CERTIFICATE OF MAILING BY "EXPRESS MAIL" 

EXPRESS MAIL LABEL NUMBER: EV 318 738 007 US 

DATE OF DEPOSIT: July 1. 2003 



I HEREBY CERTIFY THAT THIS PAPER OR FEE IS BEING DEPOSITED WITH THE UNITED STATES POSTAL 
SERVICE "EXPRESS MAIL POST OFFICE TO ADDRESSEE" SERVICE UNDER 37 CFR 1.10 ON THE DATE 
INDICATED ABOVE AND IS ADDRESSED TO THE MAIL STOP PATENT APPLICATION, COMMISSIONER FOR 
PATENTS, P.O. BOX 1450, ALEXANDRA, VA 22313-1450. 




1 



JASON BERRY 



ERRY V 



CONTINUATION 
APPLICATION 



for 



UNITED STATES LETTERS PATENT 



on 



OVEREXPRESSION OF AMINOACYL-tRNA SYNTHETASES FOR EFFICIENT 
PRODUCTION OF ENGINEERED PROTEINS CONTAINING AMINO ACffi 

ANALOGUES 

by 

David A. Tirrell 
and 

Kristi Lynn Kiick 

Sheets of Drawings: Twenty Eight (28) 
Docket No.: CIT1460-2 

Lisa A. Haile, J.D., Ph.D. 
Gray Cary Ware & Freidenrich LLP 
4365 Executive Drive, Suite 1 100 
San Diego, California 92121-2133 



DKT. 30431.6US01 SBA/KMA 



OVEREXPRESSION OF AMINOACYL-tRNA SYNTHETASES FOR EFFICIENT 
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This application is based on and claims the priority of U.S. Serial Number 60/207,627 
filed May 26, 2000, the contents of which are hereby incorporated by reference in their 
entirety. 

10 

This invention was made with Government support under NSF Grant Nos. NSF DMR- 
9996048 and US Army Research Grant DAAG55-98-1-0518. The Government has 
certain rights in this invention. 

1 5 Throughout this application various publications are referenced. The disclosures of these 
publications in their entireties are hereby incorporated by reference into this application in 
order to more fully describe the state of the art to which this invention pertains. 

FIELD OF INVENTION 

20 

The present invention relates to novel compositions and methods, for incorporating 
amino acid analogues into proteins in vivo, by overexpression of aminoacyl-tRNA 
synthetases. 

25 BACKGROUND OF INVENTION 

Expanding the scope of biological polymerizations to include non-natural monomers, is 
an area of growing interest, with important theoretical and practical consequences. An 
early and critically important example of such studies was the demonstration that 
30 "dideoxy" nucleotide monomers can serve as substrates for DNA polymerases. 
Advances in DNA sequencing (F. Sanger, S. Nicklen, A. R. Coulson, Proc. Natl. Acad 
ScL USA 1977, 14, 5463-5467), DNA base pairing models (M. J. Lutz, S. A. Benner, S. 
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Hein, G. Breipohl, E. Uhlmann, J. Am. Chem. Soc. 1997, 119, 3177-3178; J. C. Morales, 
E. T. Kool, Nature Struct. Biol. 1998, 5, 950-954), materials synthesis (W. H. Park, R. W. 
Lenz, S. Goodwin, Macromolecules 1998, 31, 1480-1486; Y. Doi, S. Kitamura, H. Abe, 
Macromolecules 1995, 28, 4822-4828), and cell surface engineering (K. J. Yarema, L. K. 
5 Mahal, R. E. Bruehl, E. C. Rodriguez, C. R. Bertozzi, J. Biol. Chem. 1998, 273, 31 168- 
31179; L. K. Mahal, K. J. Yarema, C. R. Bertozzi, Science 1997, 276, 1 125-1 128; Saxon, 
E. and Bertozzi, C:R. Science 2000, 287, 2007-2010) have resulted from the recognition 
of non-natural monomers by the enzymes that control these polymerizations. 

10 Recent investigations have shown the incorporation of modified or completely 
"synthetic" bases into nucleic acids (Matray, T.J.; Kool, E.T. Nature 1999, 399, 704; 
Kool, E.T. Biopolymers 1998, 48, 3; Morales, J.C.; Kool, E.T. Nature Struct. Biol. 1998, 
5, 950; Guckian, K.M.; Kool, E.T; Angew. Chem. Int. Ed. Eng. 1998, 36, 2825; Liu, 
D.Y.; Moran, S.; Kool, E.T. Chem. Biol. 1997, 4, 919; Moran, S.; Ren, R.X.F.; Kool, 

15 E.T Proc. Natl. Acad. Sci. USA 1997, 94, 10506; Moran, S. et al. J. Am. Chem. Soc. 
1991, 119, 2056; Benner, S.A. et al. Pure Appl. Chem. 1998, 70, 263; Lutz, M.J.; 
Horlacher J.; Benner, S.A. Bioorg. Med. Chem. Lett. 1998, 8, 1149; Lutz, M.J.; Held, 
HA.; Hottiger, M.; Hubscher, U.; Benner, S.A. Nuc. Acids Res. 1996, 24, 1308; 
Horlacher, J. et al. Proc. Natl. Acad. Sci. USA 1995, 92, 6329; Switzer, C.Y.; Moroney, 

20 S.E.; Benner, S.A. Biochemistry 1993, 32, 10489; Lutz, M.J.; Horlacher, J.; Benner, 
S.A. Bioorg. Med. Chem. Lett. 1998, 8, 499; Switzer, C; Moroney, S.E.; Benner, S.A. J. 
Am. Chem. Soc. 1989, 111, 8322; Piccirilli, J.A.; Krauch, T.; Moroney, S.E.; Benner, 
S.A. Nature 1990, 343, 33), while materials researchers have exploited the broad 
substrate range of the poly(fl-hydroxyalkanoate) (PHA) synthases to prepare novel 

25 poly(B-hydroxyalkanoate)s (PHAs) with unusual physical properties (Kim, Y.B.; Rhee, 
Y.H.; Lenz, R.W. Polym. J. 1997, 29, 894; Hazer, B.; Lenz, R.W.; Fuller, R.C. Polymer 
1996, 37, 5951; Lenz, R.W.; Kim, Y.B.; Fuller, R.C. FEMS Microbiol. Rev. 1992, 103, 
207; Park, W.H.; Lenz, R.W.; Goodwin, S. Macromolecules 1998, 31, 1480; Ballistreri, 
A. et al. Macromolecules 1995, 28, 3664; Doi, Y.; Kitamura, S.; Abe, H. 

30 Macromolecules 1995, 28, 4822). 
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Novel polymeric materials with unusual physical and/or chemical properties are also 
useful in polymer chemistry. The last several decades have shown many advances in 
synthetic polymer chemistry that provide the polymer chemist with increasing control 
over the structure of macromolecules (Szwarc, M. Nature 1956, 178, 1168-1169 Szwarc, 
5 M. Nature 1956, 778, 1168-1169; Faust, R.; Kennedy,.J.P. Polym. Bull 1986, 75, 317- 
323; Schrock, R.R. Acc. Chem. Res. 1990, 23, 158-165; Corradini, P. Macromol Symp. 
1995, 89, 1-11; Brintzinger, H.H.; Fischer, D.; Mulhaupt, R.; Rieger, B.; Waymouth, 
R.M. Angew. Chem. Int. Ed Engl. 1995, 34, 1143-1170; Dias, EX.; SonBinh, T.N.; 
Grubbs, R.H. J. Am. Chem. Soc. 1997, 77P, 3887-3897; Chiefari, J. et al. 
10 Macromolecules 1998, 31, 5559-5562). However, none of these methods have provided 
the level of control that is the basis of the exquisite catalytic, informational, and signal 
transduction capabilities of proteins and nucleic acids (Ibba, M.; Soil, D. Science 1999, 
286, 1893-1897). There remains a need for control over protein synthesis to design and 
produce artificial proteins having advantageous properties. 

15 

For this reason, the design and synthesis of artificial proteins that exhibit novel and 
potentially useful structural properties have been investigated. Harnessing the molecular 
weight and sequence control provided by in vivo synthesis would permit control of 
folding, functional group placement, and self-assembly at the angstrom length scale. 

20 Proteins that have been produced by in vivo methods exhibit predictable chain-folded 
lamellar architectures (Krejchi, M.T.; Atkins, E.D.T.; Waddon, A.J.; Fournier, M.J.; 
Mason, T.L.; Tirrell, D.A. Science 1994, 265, 1427-1432; Parkhe, A.D.; Fournier, M.J.; 
Mason, T.L.; Tirrell, D.A. Macromolecules 1993, 26(24), 6691-6693; McGrath, K.P.; 
Fournier, M.J.; Mason, T.L.; Tirrell, D.A. J. Am. Chem. Soc. 1992, 774, 727-733; Creel, 

25 H.S.; Fournier, M.J.; Mason, T.L.; Tirrell, D.A. Macromolecules 1991, 24, 1213-1214), 
unique smectic liquid-crystalline structures with precise layer spacings (Yu, S.M.; 
Conticello, V.; Zhang, G.; Kayser, C; Fournier, M.J.; Mason, T.L.; Tirrell, D.A. Nature 
1997, 389, 187-190), and controlled reversible gelation (Petka, W.A.; Hardin, J.L.; 
McGrath, K.P.; Wirtz, D.; Tirrell, D.A. Science 1998, 281, 389-392). The demonstrated 

30 ability of these protein polymers to form unique macromolecular architectures will be of 
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importance for engineering materials with interesting liquid-crystalline, crystalline, 
surface, electronic, and optical properties. 

Novel chemical and physical propierties that can be engineered into protein polymers may 
5 be expanded by the precise placement of amino acid analogues. Efforts to incorporate 
novel amino acids into proteins in vivo have relied on the ability of the translational 
apparatus to recognize amino acid analogues that differ in structure and functionality 
from the natural amino acids. The in vivo incorporation of amino acid analogues into 
proteins is controlled most stringently by the aminoacyl-tRNA synthetases (AARS), the 

10 class of enzymes that safeguards the fidelity of amino acid incorporation into proteins 
(Figure 1). The DNA message is translated into an amino acid sequence via the pairing 
of the codon of the messenger RNA (mRNA) with the complementary anticodon of the 
aminoacyl-tRNA. Aminoacyl-tRNA synthetases control the fidelity of amino acid 
attachment to the tRNA. The discriminatory power of the aminoacyl-tRNA synthetase 

15 places severe limits on the set of amino acid structures that can be exploited in the 
engineering of natural and artificial proteins in vivo. 

Several strategies for circumventing the specificity of the synthetases have been explored. 
Introduction of amino acid analogues can be achieved relatively simply via solid-phase 
20 peptide synthesis (Merrifield, R. B. Pure & Appl. Chem. 1978, 50, 643-653). While this 
method circumvents all biosynthetic machinery, the multistep procedure is limited to 
synthesis of peptides less than or equal to approximately 50 amino acids in length, and is 
therefore not suitable for producing protein materials of longer amino acid sequences. 

25 Chemical aminoacylation methods, introduced by Hecht and coworkers (Hecht, S.M. 
Acc. Chem. Res. 1992, 25, 545; Heckler, T.G.; Roesser, J.R.; Xu, C; Chang, P.; Hecht, 
S.M. Biochemistry 1988, 27, 7254; Hecht, S.M.; Alford, B.L.; Kuroda, Y.; Kitano, S. J. 
Biol Chem. 1978, 253, 4517) and exploited by Schultz, Chamberlin, Dougherty and 
others (Cornish, V.W.; Mendel, D.; Schultz, P.G. Angew. Chem. Int. Ed. Engl. 1995, 34, 

30 621; Robertson, S.A.; Ellman, J.A.; Schultz, P.G. J. Am. Chem. Soc. 1991, 113, 2722; 
Noren, CJ.; Anthony-Cahill, S.J.; Griffith, M.C.; Schultz, P.G. Science 1989, 244, 182; 
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Bain, J.D.; Glabe, C.G.; Dix, T.A.; Chamberlin, A.R. J. Am. Chem. Soc. 1989, 111, 
8013; Bain, J.D. et al. Nature 1992, 555,537; Gallivan, J.P.; Lester, H.A.; Dougherty, 
D.A. Chem. Biol. 1997, 4, 740; Turcatti, et al. J. Biol. Chem. 1996, 271, 19991; 
Nowak, M.W. et al. Science, 1995, 268, 439; Saks, M.E. et al. J. Biol. Chem. 1996, 
5 271, 23169; Hohsaka, T. et al. J. Am. Chem. Soc. 1999, 121, 34), avoid the synthetases 
altogether, but provide low protein yields. 

Alteration of the synthetase activities of the cell is also possible, either through 
mutagenesis or through introduction of heterologous synthetases (Ibba, M; Hennecke, H. 

10 FEBSLett. 1995, 364, 272; Liu, D.R.; Maghery, T.J.; Pastrnak, M.; Schultz, P.G. Proc. 
Natl. Acad. Sci. USA, 1997, 94, 10092; Furter, R. Protein Sci. 1998, 7, 419; Ohno, S. et 
al., J. Biochem. 1998, 124, 1065; Liu, D.R.; Schultz, P.G. Proc. Natl. Acad. Sci. 1999, 
96, 4780; Wang, L.; Magliery, T. J.; Liu, D. R.; Schultz, P. G. J. Am. Chem. Soc. 2000, 
122, 5010-501 1; Pastrnak, M.; Magliery, T. J.; Schultz, P. G. Helv. Chim. Acta 2000, 83, 

15 2277-2286). 

In some instances, the ability of the wild-type synthetases to accept amino acid analogues 
has been exploited. For example, wild-type synthetases have been shown to activate and 
charge substrates other than the canonical, proteinogenic amino acids (Cowie, D.B.; 

20 Cohen, G.N. Biochim. Biophys. Acta. 1957, 26, 252; Richmond, M.H. Bacteriol Rev. 
1962, 26, 398; Horton, G.; Boime, I. Methods Enzymol. 1983, 96, 111; Wilson, M.J.; 
Hatfield, D.L. Biochim. Biophys. Acta 1984, 781, 205). This approach offers important 
advantages with respect to synthetic efficiency, in that neither chemical acylation of 
tRNA nor cell-free translation is required. The simplicity of the in vivo approach, its 

25 relatively high synthetic efficiency, and its capacity for multisite substitution, make it the 
method of choice for production of protein materials whenever possible. 

The capacity of the wild-type translational apparatus has been previously demonstrated to 
utilize amino acid analogues bearing fluorinated (Richmond, M.H. J. Mol. Biol. 1963, 6, 
30 284; Fenster, E.D.; Anker, H.S. Biochemistry 1969, 8, 268; Yoshikawa, E.; Fournier, 
M.J.; Mason, T.L.; Tirrell, D.A. Macromolecules 1994, 27, 5471), unsaturated (Van 
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Hest, J.C.M.; Tirrell, D.A. FEES Lett. 1998, 428, 68; Deming, T.J.; Fournier, MJ.; 
Mason, T.L.; Tirrell, D.A. J. Macromol Sci. - Pure Appl. Chem. 1997, A34, 2134), 
electroactive (Kothakota, S.; Mason, T.L.; Tirrell, DA.; Fournier, M.J. J. Am. Chem. 
Soc. 1995, 777, 536), and other useful side chain functions. The chemistries of the above 
5 functional groups are distinct from the chemistries of the amine, hydroxyl, thiol, and 
carboxylic acid functional groups characteristic of proteins; this makes their 
incorporation particularly attractive for targeted chemical modification of proteins. 

For example, alkene functionality introduced into artificial proteins via dehydroproline 
10 can be quantitatively modified via bromination and hydroxylation (Deming, T.J.; 
Fournier, M.J.; Mason, T.L.; Tirrell, D.A. J. Macromol. Sci. Pure Appl. Chem. 1997, 
A34, 2143-2150). Alkene functionality, introduced by incorporation of other amino acid 
analogues, should be useful for chemical modification of proteins by olefin metathesis 
(Clark, T.D.; Kobayashi, K.; Ghadiri, M.R. Chem. Eur. J. 1999, J, 782-792; Blackwell, 
15 H.E.; Grubbs, R.H. Angew. Chem. Int. Ed. Engl. 1998,. 37, 3281-3284), palladium- 
catalyzed coupling (Amatore, C.; Jutand, A. J. Organomet. Chem. 1999, 576, 255-277; 
Tsuji, J. Palladium Reagents and Catalysts: Innovations in Organic Synthesis', John 
Wiley and Sons: New York, 1995; Schoenberg, A.; Heck, R.F. J. Org. Chem. 1974, 39, 
3327-3331), and other chemistries (Trost, B.M.; Fleming, I., Eds. Comprehensive 
20 Organic Synthesis; Pergamon Press: Oxford, 1991). The incorporation of fluorinated 
functional groups into proteins has imparted to protein films the low surface energy 
characteristic of fluoropolymers; contact angles of hexadecane on fluorinated protein 
polymers (70°) are much higher than those on unfluorinated controls (17°) (Yoshikawa, 
E.; Fournier, M.J.; Mason, T.L.; Tirrell, D.A. Macromolecules 1994, 27, 5471-5475). 

25 

Methionine (1) (Figure 1) is a possible target for substitution by amino acid analogues, 
with its hydrophobicity and polarizability, make it an important amino acid for regulating 
protein structure and protein-protein recognition processes (T. Yuan, A. M. Weljie, H. J. 
Vogel, Biochemistry 1998, 37, 3187-3195; H. L. Schenck, G. P. Dado, S. H. Gellman, J. 
30 Am. Chem. Soc. 1996, 118, 12487-12494; Maier, K.L.; Lenz, A.G., Beck-Speier, I.; 
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Costabel, U. Methods Enzymol 1995, 251, 455-461). Replacement of methionine by its 
analogues may therefore permit purposeful manipulation of these properties. 

Several analogues of methionine (1), specifically selenomethionine, telluromethionine, 
5 norleucine, trifluoromethionine and ethionine (Hendrickson, W.A.; Horton, J.R.; 
Lemaster, D.M. EMBOJ. 1990, 9, 1665; Boles, J.O. et al Nature Struct. Biol. 1994, 1, 
283; Cowie, D.B.; Cohen, G.N.; Bolton, E.T.; de Robichon-Szulmajster, H. Biochim. 
Biophys. Acta 1959, 34, 39; Duewel, H.; Daub, E.; Robinson, R.; Honek, J.F. 
Biochemistry 1997, 36, 3404; Budisa, N.; Steipe, B.; Demange, P.; Eckerskorn, G; 
10 Kellerman, J.; Huber, R. Eur. J, Biochem. 1995, 230, 788), have been shown to exhibit 
translational activity in bacterial hosts. Incorporation of selenomethionine in place of 
methionine has long been known to facilitate protein structure determination by x-ray 
crystallography (Wei, Y.; Hendrickson, W.A.; Crouch, R.J.; Satow, Y. Science 1990, 
249, 1398-1405). 

15 

However, only a limited number of amino acid analogues have been shown to 
conclusively exhibit translational activity in vivo, and the range of chemical functionality 
accessible via this route remains modest. These circumstances dictate a need for a 
systematic search for new amino acid analogues and strategies that will allow, the 
20 engineering of proteins with novel chemical and physical properties. 

SUMMARY OF INVENTION 

The present invention seeks to overcome these and other disadvantages in the prior art by 
25 providing a novel method for incorporating amino acid analogues into polypeptides of 
interest in vivo by expanding the scope of amino acid analogues that are incorporated and 
increasing protein yields. Preferably, the production of modified polypeptides can be in a 
host- vector system in which a natural amino acid in the wild-type polypeptide is replaced 
with a selected amino acid analogue by overexpressing an aminoacyl-tRNA synthetase 
30 corresponding to the natural amino acid so replaced. 
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In addition, the present invention provides novel host-vector systems. The host-vector 
system produces an aminoacyl-tRNA synthetase in an amount in excess of the level of a 
naturally occurring aminoacyl-tRNA synthetase. The system also produces a polypeptide 
of interest in an amount in excess of the level produced by a naturally occurring gene 
5 encoding the polypeptide of interest. 

Nucleic acids encoding the expression vectors, hosts, and methods of integrating a 
desired amino acid analogue into target polypeptides are also provided. 

10 The invention further provides purified dihydrofolate reductase polypeptides, produced 
by the methods of the invention, in which the methionine residues have been replaced 
with homopropargylglycine (2-amino-hexynoic acid), homoallylglycine (2-amino- 
hexenoic acid), m-crotylglycine (cis-2-amino-4-hexenoic acid), frww-crotylglycine 
(7ra«^-2-amino-4-hexenoic acid), norleucine, 6,6,6-trifluoro-2-amino hexanoic acid, 2- 

15 amino-heptanoic acid, norvaline, o-allylserine, 2-butynylglycine, allylglycine or 
propargylglycine. The formation of the modified polypeptides demonstrate the ease and 
efficiency of the methods of the invention for incorporating amino acid analogues such 
as, methionine analogues, into proteins such as, dihydrofolate reductase. 

20 Using the methods of the invention, it is possible to produce entirely new polypeptides 
containing amino acid analogues having unusual properties. 

BRIEF DESCRIPTION OF THE FIGURES 

25 Figure 1 depicts a schematic diagram of in vivo protein synthesis. 

Figure 2 depicts a set of methionine analogues (2-13), as described in Example I, infra. 

Figure 3 illustrates the SDS-PAGE analysis of mDHFR synthesis by E. coli strain 
30 CAG18491/pREP4/pQE15, as described in Example I, infra. Cultures were 
supplemented with methionine or with one of the analogues 2 - 9, as indicated. Each lane 
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is identified in terms of the time of analysis subsequent to addition of the inducer IPTG. 
mDHFR is visualized by staining with Coomassie Brilliant Blue. The target protein can 
be detected only in cultures supplemented with methionine or with analogues 2, 3, or 9, 
respectively. 

5 

Figure 4 shows the determination of the occupancy of the initiator site in: a). mDHFR, b). 
mDHFR-E (alkene) and c). mDHFR- Y(alkyne), as described in Example I, infra. 
Chromatograms are shown for analysis of the N-terminal residue in each of the three 
proteins, as determined via Edman degradation. The signals corresponding to 
10 methionine, 2 and 3 elute at 12.3, 14.3 and 11.0 min, respectively. The strong signal at 
13.8 min is due to piperidylphenylthiourea, a by-product of the analysis. Signals 
assigned to 2 and 3 were verified by analysis of authentic samples of the analogues. 

Figure 5 depicts the activation of methionine and methionine analogues by MetRS 
15 (Methionyl tRNA synthetase), as described in Example I, infra. The amount of PPj 
exchanged in 20 minutes is shown for methionine (1) and for methionine analogues 2 - 
13. The background (14) is given for a reaction mixture lacking both enzyme and amino 
acid. 

20 Figure 6 illustrates the electron density maps (colored surfaces) and negative isopotential 
surfaces (meshes) for methionine (a) and for analogues 2, 3 and 5 (b - d, respectively), as 
described in Example I, infra. The electron density maps indicate electron-rich (red) and 
electron-poor (blue) regions of each molecule. For simplicity, the amino acid form is 
shown; this avoids representation of the highly extended isopotential surface of the 

25 carboxylate anion of the zwitterion and facilitates comparison of side-chain electronic 
structure. 

Figure 7 shows the SDS-PAGE analysis of mDHFR synthesis by E. coli strains 
B834(DE3)/pQE15/pREP4 (designated pQE15) and B834(DE3)/pQE15-MRS/pREP4 
30 (designated pQE15-MRS), as described in Example II, infra. Cultures (M9 + 19AA) 
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were supplemented with nothing (-Met), methionine (Met) or trans-crotylglycine (60 
mg/L) (Tcg) 5 as indicated. 

Figure 8 depicts the SDS-PAGE analysis of DHFR synthesis by E. coli strains 
5 CAG18491/pQE15/pREP4 and CAG18491/pQE15-MRS/pREP4, as described in 
Example II, infra. Cultures (M9 + 19AA) were supplemented with nothing (-Met), 
methionine (+ Met), or 2-butynylglycine (60 mg/L) (+ 2bg), as indicated. 

Figure 9 shows the SDS-PAGE analysis of mDHFR synthesis by E coli strains 
10 CAG18491/pQE15/pREP4 (the left panel in each pair, pQE15) and CAG18491/pQE15- 
MRS/pREP4 (the right panel in each pair, pQE15-MRS), as described in Example II, 
infra. Cultures (M9 + 19AA) were supplemented with the analogues 4, 6, 7, 8, 10, 12, 
and 13 at 500 mg/1, as indicated. Negative (- Met) and positive (+ Met) controls of 
CAG18491/pQE15-MRS/pREP4 cultures are also shown for comparison 

15 

Figure 10 is a table detailing kinetic parameters for methionine analogues in the ATP-PPj 
exchange reaction and analogue's ability to support protein biosynthesis in cultures of a 
conventional bacterial host supplemented with the analogues, as described in Example II, 
infra. 

20 

Figure 1 1 illustrates the activation rates of methionine by whole cell lysates, as described 
in Example II, infra. Maximum ATP-PPi exchange velocities, measured at a saturating 

concentration of methionine (750 pM), are shown for whole cell lysates of 
B834(DE3)/pQE15/pREP4 (solid) and of B834(DE3)/pQE15-MRS/pREP4 (striped). 
25 Rates were measured for (a) cell lysates obtained from cultures prior to protein 
expression, (b) cell lysates obtained from cultures supplemented with methionine during 
protein expression, and (c) cell lysates obtained from cultures supplemented with 
transcrotylglycine during protein expression. 

30 Figure 12 shows the Proton NMR spectra (599.69 MHz) of (a) mDHFR, (b) Teg, (c and 
d) mDHFR-Tcg, as described in Example II, infra. Samples were dissolved at 
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concentrations of approximately 10 mg/ml in D2O containing 2% rf-formic-(i-acid and 
spectra were collected at 25 °C overnight. 



Figure 13 depicts the N-terminal sequencing results indicating occupancy of the initiator 
5 site in mDHFR-Tcg, as described in Example II, infra. Chromatograms are shown for (a) 
the N-terminal residue of mDHFR, (b) Teg control, and (c) the N-terminal residue of 
mDHFR-Tcg, as determined via Edman degradation. 

Figure 14 depicts the N-terminal sequencing results indicating occupancy of the initiator 
10 site in mDHFR-bg, as described in Example II, infra. Chromatograms are shown for (a) 
the N-terminal residue of mDHFR, (b) 2bg control, and (c) the N-terminal residue of 
mDHFR-2bg, as determined via Edman degradation. 

Figure 15 is a table of the kinetic parameters for methionine analogues in the ATP-PPj 
15 exchange reaction and protein yields for bacterial cultures supplemented with the 
analogues, as described in Example III, infra. 

Figure 16 shows the comparison of the kinetic parameters for methionine analogues in 
the ATP-PPj exchange reaction and relative protein yields from conventional bacterial 
20 host cultures supplemented with the analogues, as described in Example III, infra. 

Figure 17 depicts the Western blot analysis of protein synthesis by bacterial expression 
hosts CAG18491/pQE15/pREP4 (pQE15) and CAG18491/pQE15-MRS/pREP4 (MRS). 
Bacterial cultures were supplemented with methionine, 2, 3 or 9, as described in Example 
25 III infra. 

Figure 18 illustrates the activation (a) and aminoacylation (b) steps of amino acid 
attachment to tRNA, as described in Example III, infra. 

30 Figure 19 depicts the sequence of pQE15-MRS (SEQ ID NO.: 1). 
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Figure 20 depicts the sequence of pQE15-W305F (SEQ ID NO.: 2) 
DETAILED DESCRIPTION OF THE INVENTION 



5 As used in this application, the following words or phrases have the meanings specified. 
Definitions 

As used herein, a polypeptide refers to a peptide or protein having natural amino acids. 

10 

As used herein, modified polypeptides are polypeptides having amino acid analogues 
incorporated into their amino acid sequence. 

As used herein, a "natural amino acid" is one of the 20 naturally occuring amino acids, 
15 namely glycine, alanine, valine, leucine, isoleucine, serine, threonine, aspartic acid, 
glutamic acid, asparagine, glutamine, lysine, arginine, cysteine, methionine, 
phenylalanine, tyrosine, tryptophan, histidine and proline. 

As used herein, the term "amino acid analogue" refers to a compound that has a structure 
20 analogue to a natural amino acid but mimics the structure and/or reactivity of a natural 
amino acid. This includes all amino acids but the natural 20 amino acids are referred to 
as amino acid analogues even if they are naturally present (e.g. hydroxyproline). 

As used herein, the term "peptide" refers to a class of compounds composed of amino 
25 acids chemically bound together with amide linkages (CONH). Peptide as used herein 
includes oligomers of amino acids and small and large peptides, including polypeptides. 

As used herein, "polypeptides" embrace all peptides and those polypeptides generally 
defined as proteins and also those that are glycosylated, e.g. glycoproteins. 

30 
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METHODS OF THE INVENTION 



The present invention is based on the discovery that incorporation of amino acid 
analogues into polypeptides can be improved in cells that overexpress aminoacyl-tRNA 
5 synthetases that recognize amino acid analogues as substrates. "Improvement" is defined 
as either increasing the scope of amino acid analogues (i.e. kinds of amino acid 
analogues) that are incorporated or by increasing the yield of the modified polypeptide. 
Overexpression of the aminoacyl-tRNA synthetase increases the level of aminoacyl- 
tRNA synthetase activity in the cell . The increased activity leads to an increased rate of 
10 incorporation of amino acid analogues into the growing peptide, thus the increased rate of 
synthesis of the polypeptides, thereby increasing the quantity of polypeptides containing 
amino acid analogues, i.e. modified polypeptides, produced. 

In general, the methods of the invention comprises introducing into a host cell, a vector 
1 5 having nucleic acids encoding an aminoacyl-tRNA synthetase, and nucleic acids encoding 
a polypeptide of interest to produce a host-vector system. The nucleic acids, encoding the 
aminoacyl-tRNA synthetase, and the nucleic acids encoding the polypeptide of interest, 
may be located in the same or different vectors. The vectors include expression control 
elements which direct the production of the aminoacyl-tRNA synthetase, and the 
20 polypeptide of interest. The expression control elements (i.e. regulatory sequences) can 
include inducible promotors, constitutive promoters, secretion signals, enhancers, 
transcription terminators, and other transcriptional regulatory elements. 

In the host-vector system, the production of an aminoacyl-tRNA synthetase can be 
25 controlled by a vector which comprises expression control elements that direct the 
production of the aminoacyl-tRNA synthetase. Preferably, the production of aminoacyl- 
tRNA synthetase is in an amount in excess of the level of naturally occurring aminoacyl- 
tRNA synthetase, such that the activity of the aminoacyl-tRNA synthetase is greater than 
naturally occuring levels. 

30 



13 



In the host-vector system, the production of a polypeptide of interest can be controlled by 
a vector which comprises expression control elements for producing the polypeptide of 
interest. Preferably, the polypeptide of interest so produced is in an amount in excess of 
the level produced by a naturally occurring gene encoding the polypeptide of interest. 

5 

The host-vector system can be constitutively overexpressing the aminoacyl-tRNA 
synthetase and induced to overexpress the polypeptide of interest by contacting the host- 
vector system with an inducer, such as isopropyl-p-D-thiogalactopyranoside (IPTG). The 
host-vector system can also be induced to overexpress the aminoacyl-tRNA synthetase 
10 and/or the protein of interest by contacting the host- vector system with an inducer, such 
as IPTG. Other inducers include stimulation by an external stimulation such as heat 
shock. 

Using the methods of the invention, any natural amino acid can be selected for 
15 replacement by an amino acid analogue in the polypeptide of interest. An amino acid 
analogue is preferably an analogue of the natural amino acid to be replaced. To replace a 
selected natural amino acid with an amino acid analogue in a polypeptide of interest, an 
appropriate corresponding aminoacyl-tRNA synthetase must be selected. For example, if 
an amino acid analogue will replace a methionine residue, then preferably a methionyl 
20 tRNA synthetase is selected. 

The host-vector system is grown in media lacking the natural amino acid and 
supplemented with an amino acid analogue, thereby producing a modified polypeptide 
that has incorporated at least one amino acid analogue. This method is superior to 
25 existing methods as it improves, the efficiency of incorporation of amino acid analogues 
into polypeptides of interest and increases the quantity of modified polypeptides so 
produced. 

In an embodiment of the invention, where the host-vector system is an auxotrophic 
30 system, the host-vector system is initially grown in media which includes all essential 
amino acids, induced to express the polypeptide of interest, and subsequently after 
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induction, is grown in media lacking the natural amino acid and supplemented with an 
amino acid analogue, thereby producing a modified polypeptide that has incorporated at 
least one amino acid analogue. 

5 For example, the method of the invention can be practiced by: (1) growing the host- 
vector system under suitable conditions having the natural amino acid and under 
conditions such that the host-vector system overexpresses the aminoacyl-tRNA synthetse; 
(2) collecting and washing cells to remove presence of the natural amino acid; (3) 
resuspending the cells in media medium which lacks the natural amino acid and has an 
10 amino acid analogue; (4) inducing the expression of the polypeptide of interest; (5) 
growing the cells in a medium which lacks the natural amino acid and has an amino acid 
analogue under conditions such that the host-vector system overexpresses the aminoacyl- 
tRNA synthetase and the polypeptide molecule of interest; and (6) isolating the modified 
polypeptide of interest. 

15 

In an embodiment of the invention, the polypeptide of interest is dihydro folate reductase, 
the natural amino acid is methionine, the aminoacyl-tRNA synthetase is methionyl tRNA 
synthetase, and the amino acid analogues of methionine are 6,6,6-trifluoromethionine, 
homoallyglycine, homoproparglycine, norvaline, norleucine, cw-crotylglycine, trans- 
20 crotylglycine, 2-aminoheptanoic acid, 2-butynylglycine, allylglycine, azidoalanine and 
azidohomoalanine. 

Polypeptides of Interest 

25 In accordance with the invention, the polypeptides may be from any source whether 
natural, synthetic, semi-synthetic, or recombinant. These include hormones, enzymes 
and protein fibers. Of these proteins, well-known examples are insulin, interferons, 
growth hormones, serum albumin and epidermal growth factor. 

30 The polypeptides of interest can be those which wild-type cells cannot naturally produce. 
In view of the diversity of the modified polypeptides that can be produced using the 
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methods of the invention, it is preferable that the polypeptide of interest be different from 
those produced by wild type cells. 

Natural Amino Acids 

5 

Natural amino acids are amino acid residues that will be replaced in a polypeptide of 
interest by a desired amino acid analogue using the methods of the invention. 

Amino acids constituting a natural amino acid residue may be selected from the 20 
10 natural amino acids, namely glycine, alanine, valine, leucine, isoleucine, serine, 
threonine, aspartic acid, glutamic acid, asparagine, glutamine, lysine, arginine, cysteine, 
methionine, phenylalanine, tyrosine, tryptophan, histidine and proline, that constitute the 
amino acid sequence of a polypeptide of interest. 

15 Aminoacyl-tRNA synthetases 

Aminoacyl-tRNA synthetases can be from any source whether natural, synthetic, semi- 
synthetic or recombinant (mutated or genetically engineered). Accordingly, the 
aminoacyl-tRNA synthetases can be from any eukaryotic or prokaryotic cell. Aminoacyl- 

20 tRNA synthetases can have originated from the same or different cell as the host cell. 
Types of aminoacyl-tRNA synthetases can include but are not limited to glycine, alanine, 
valine, leucine, isoleucine, serine, threonine, aspartic acid, glutamic acid, asparagine, 
glutamine, lysine, arginine, cysteine, methionine, phenylalanine, tyrosine, tryptophan, 
histidine and proline t-RNA synthetases. In accordance with the invention, selection of 

25 an appropriate aminoacyl-tRNA synthetase depends on the natural amino acid so selected 
to be replaced by an amino acid analogue. For example, if an amino acid analogue will 
replace methionine, then a methionyl tRNA synthetase is used. 

It may be possible to use genetically engineered aminoacyl-tRNA synthetases that 
30 recognize amino acid analogues and are able to facilitate the incorporation of that amino 
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acid analogue into a polypeptide. For example, hydroxy acids can be incorporated to 
form an ester linkage in place of an amide linkage of polypeptides. 

Aminoacyl-tRNA synthetases can be mutated or genetically engineered to enhance 
5 properties of the enzyme to facilitate the incorporation of the amino acid analogues into 
polypeptides of interest. For example, the editing function of the aminoacyl-tRNA 
synthetases can be eliminated. 

Nucleic acid seqences encoding the appropriate aminoacyl-tRNA synthetase are used in 
1 0 the methods of the invention. 

Amino acid analogues 

The amino acid analogues incorporated into polypeptides using the methods of . this 
1 5 invention are different from the twenty naturally occurring counterparts in their side chain 
functionality. The amino acid analogue can be a close analogue of one of the twenty natural 
amino acids, or it can introduce a completely new functionality and chemistry. The amino 
acid analogue can replace an existing amino acid in a protein (substitution). 

20 There may be a variety of amino acid analogues that can be added to a medium according 
to the present invention. Suitable amino acid analogues include, but are not limited to, 
molecules having fluorinated, electroactive, conjugated, azido, carbonyl, alkyl and 
unsaturated side chain functionalities. The following are representative examples of amino 
acid analogues: 

25 

Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, such that the imino groups or divalent non-carbon atoms such as oxygen or 
sulfur of the side chain of the natural amino acids have been substituted by methylene 
groups, or, alternatively, amino groups, hydroxyl groups or thiol groups have been 
30 substituted by methyl groups, olefin, or azido groups, so as to eliminate their ability to 
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form hydrogen bonds, or to enhance their hydrophobic properties (e.g. methionine to 
norleucine). 

Amino acid analogues which are modifications of natural amino acids in the side chain 
5 functionality, such that the methylene groups of the side chain of the natural amino acids 
have been substituted by imino groups or divalent non-carbon atoms or, alternatively, 
methyl groups have been substituted by amino groups, hydroxyl groups or thiol groups, 
so as to add ability to form hydrogen bonds or to reduce their hydrophobic properties 
(e.g. leucine to 2-aminoethylcysteine, or isolecine to o-methylthreonine). 

10 

Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, such that a methylene group or methyl groups have been added to the side 
chain of the natural amino acids to enhance their hydrophobic properties (e.g. Leucine to 
gamma-Methylleucine, Valine to beta-Methylvaline (r-Leucine)). 

15 

Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, such that a methylene groups or methyl groups of the side chain of the 
natural amino acids have been removed to reduce their hydrophobic properties (e.g. 
Isoleucine to Norvaline). 

20 

Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, such that the amino groups, hydroxyl groups or thiol groups of the side 

Q 

chain of the natural amino acids have been removed or methylated to eliminate their 
ability to form hydrogen bonds (e.g.Threonine to o-methylthreonine or Lysine to 
25 Norleucine). 

Optical isomers of the side chains of natural amino acids (e.g. Isoleucine to 
Alloisoleucine); 

30 Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, such that the substituent groups have been introduced as side chains to the 
natural amino acids (e.g. Asparagine to beta-fluoroasparagine). 
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Amino acid analogues which are modifications of natural amino acids where the atoms of 
aromatic side chains of the natural amino acids have been replaced to change the 
hydrophobic properties, electrical charge, fluorescent spectrum or reactivity (e.g. 
5 Phenylalanine to Pyridylalanine, Tyrosine to p-Aminophenylalanine). 

Amino acid analogues which are modifications of natural amino acids where the rings of 
aromatic side chains of the natural amino acids have been expanded or opened so as to 
change hydrophobic properties, electrical charge, fluorescent spectrum or reactivity (e.g. 
1 0 Phenylalanine^ Naphthylalanine, Phenylalanine to Phenylalanine). 

Amino acid analogues which are modifications of the natural amino acids in which the 
side chains of the natural amino acids have been oxidized or reduced so as to add or 
remove double bonds (e.g. Alanine to Dehydroalanine, Isoleucine to Beta- 
15 methylenenorvaline). 

Amino acid analogues which are modifications of proline in which the five-membered 
ring of proline has been opened or, additionally, substituent groups have been introduced 
(e.g. Proline to N-methylalanine). 

20 

Amino acid analogues which are modifications of natural amino acids in the side chain 
functionality, in which the second substituent group has been introduced at the alpha- 
position (e.g. Lysine to alpha-difluoromethyllysine). 

25 Amino acid analogues which are combinations of one or more alterations, as described 
supra (e.g. Tyrosine to p-Methoxy-m-hydroxyphenylalanine), 

Amino acid analogues which differ in chemical structures from natural amino acids but 
can serve as substrates for aminoacyl-tRNA synthetase by assuming a conformation 
30 analogous to natural amino acids when bound to this enzyme, (e.g. Isoleucine to 
Furanomycin) 
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Types of amino acid analogues of methionine are 6,6,6-trifluoromethionine, 
homoallyglycine, homoproparglycine, norvaline, norleucine, c/s-crotylglycine, trans- 
crotylglycine, 2-aminoheptanoic acid, 2-butynylglycine, allylglycine, azidoalanine and 
5 azidohomoalanine. 

Vectors 

In accordance with the methods of the invention, suitable expression vectors which may 
10 be used include, but are not limited to, viral particles, baculovirus, phage, plasmids, 
phagemids, cosmids, phosmids, bacterial artificial chromosomes, viral DNA (e.g. 
vaccinia, adenovirus, foul pox virus, pseudorabies and derivatives of SV40), PI -based 
artificial chromosomes, yeast plasmids, yeast artificial chromosomes, and any other 
vectors specific for specific hosts of interest (such as bacillus, aspergillus, yeast, etc.) 
1 5 Such vectors can be chromosomal, nonchromosomal or synthetic DNA sequences. 

Large numbers of suitable vectors are known to those of skill in the art, and are 
commercially available. The following vectors are provided by way of example; 
Bacterial: pQE70, pQE60, pQE-9, pQE15 (Qiagen, Valencia, CA), psiX174, pBluescript 
20 SK, pBluescript KS, (Stratagene, La Jolla, CA); pTRC99a, pKK223-3, pKK233-3, 
pDR540, pRIT2T (Pharmacia, Uppsala, Sweden); Eukaryotic: pWLNEO, pXTl, pSG 
(Stratagene, La Jolla, CA) pSVK3, pBPV, PMSG, pSVLSV40 (Pharmacia, Uppsala, 
Sweden). 

25 A preferred vector for expression may be an autonomously replicating vector comprising 
a replicon that directs the replication of the nucleic acids within the appropriate host cell. 
The preferred vectors also include an expression control element, such as a promoter 
sequence, which enables transcription of the inserted sequences and can be used for 
regulating the expression (e.g., transcription and/or translation) of an operably linked 

30 sequence in an appropriate host cell such as Escherichia coll Methods for generating 
vectors are well known in the art, for example, see Maniatis, T., et al., 1989 Molecular 
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Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
incorporated by reference herein. 

Expression control elements are known in the art and include, but are not limited to, 
5 inducible promoters, constitutive promoters, secretion signals, enhancers, transcription 
terminators, and other transcriptional regulatory elements. Other expression control 
elements that are involved in translation are known in the art, and include the Shine- 
Dalgarno sequence, and initiation and termination codons. The preferred vector also 
includes at least one selectable marker gene that encodes a gene product that confers drug 
10 resistance, such as resistance to ampicillin or tetracyline. The vector also comprises 
multiple endonuclease restriction sites that enable convenient insertion of exogenous 
DNA sequences. 

The preferred vectors for generating polypeptides of interest are those compatible to 
15 prokaryotic host cells. Prokaryotic cell expression vectors are well known in the art and are 
available from several commercial sources. For example, a pQE vector (e.g., pQE15, 
available from Qiagen Corp., Valencia, CA) may be used to express polypeptides of 
interest, containing natural amino acids and modified polypeptides, including those 
containing amino acid analogues, in bacterial host cells. 

20 

The nucleic acids derived from a microorganism(s) may be inserted into the vector by a 
variety of procedures. In general, the nucleic acids can be inserted into an appropriate 
restriction endonuclease site(s) by procedures known in the art. Such procedures and 
others are deemed to be within the scope of those skilled in the art. 

25 

The nucleic acid sequence encoding the aminoacyl-tRNA synthetase or polypeptide of 
interest in the expression vector may be operatively linked to an appropriate expression 
control sequence(s) (promoter) to direct mRNA synthesis. Bacterial promoters include 
lad, lacZ, T3, T7, gpt, lambda P R , P L and trp. Eukaryotic promoters include CMV 
30 immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
mouse metallothionein-I. 
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Selection of the appropriate vector and its correlative promoter is well within the level of 
ordinary skill in the art. The expression vector may also contain a ribosome binding site 
for translation initiation and a transcription terminator. The vector may also include 
5 appropriate sequences for amplifying expression. Promoter regions can be selected from 
any desired gene using CAT (chloramphenicol transferase) vectors or other vectors with 
selectable markers. Chemical or temperature sensitive promotors can be used for 
inducing the expression of either the aminoacyl-tRNA synthetase or the target protein. 

10 In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenptypic trait for selection of transformed host cells, such as 
dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or tetracycline 
or ampicillin resistance in E. coli. 

15 Inducers 

In accordance with the methods of the invention when the expression control element is 
an inducible promotor, the promoter may be induced by an external stimulus, such as by 
adding a compound (e.g. IPTG) or by heat shocking to initiate the expression of the gene. 

20 

Level of Expression 

In accordance with the methods of the invention, the production of the aminoacyl-tRNA 
synthetase and/or the polypeptide of interest is preferably in an amount in excess of the 
25 level (any increase that is meaningful or confers a benefit) produced by a naturally 
occurring gene encoding the aminoacyl-tRNA synthetase and/or the polypeptide of 
interest. 

The increase in the level of aminoacyl-tRNA synthetase and/or the polypeptide of interest 
30 can be measured by monitoring an increase in protein expression by gel electrophoresis, 
western blot analysis, or other relevant methods of protein detection (Maniatis, T., et al., 



22 



1989 Molecular Cloning, A Laboratory Manual Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY). 

The increase in the level of aminoacyl-tRNA synthetase can also be determined by 
5 measuring the ATP-PPj exchange activity (Mellot, P.; Mechulam, Y.; LeCorre, D.; 
Blanquet, S.; Fayat, G. J. Mol Biol 1989, 205, 429; Blanquet, S.; Fayat, G.; Waller, J.- 
P. Eur, 1 Biochem. 1974, 44, 343; Ghosh, G.; Pelka, H.; Schulman, L. H. Biochemistry 
1990, 29, 2220) of cell lysates. 

10 Fusion genes 

In accordance with the methods of the invention, a ftision gene includes a sequence 
encoding a polypeptide of the invention operatively fused (e.g., linked) to a non-related 
sequence such as, for example, a tag sequence to facilitate isolation and/or purification of 
15 the expressed gene product (Kroll, D.J., et ah, 1993 DNA Cell Biol 12:441-53). The pQE 
expression vectors used in this invention express proteins fused to a poly-Histidine tag 
that facilitates isolation and/or purification of the expressed gene. 

Host Cells 

20 

In accordance with the methods of the invention, types of host cells include, but are not 
limited to, bacterial cells, such as E. coli, Streptomyces, Salmonella typhimurium; fungal 
cells, such as yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells 
such as CHO, COS or Bowes melanoma; adenoviruses; plant cells, etc. The selection of 
25 an appropriate host is deemed to be within the scope of those skilled in the art from the 
teachings herein. 

A preferred embodiment of a host cell is an auxotroph. Auxotrophs depend upon the 
external environment to supply certain amino acids, for example, a methionine auxotroph 
30 depends on methionine in the growth medium for its survival The choice of auxotroph is 
dependent on the amino acid that is selected to be replaced by an amino acid analogue in 
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the target protein (e.g. if methionine is selected, then a methionine auxotroph is 
employed, if phenylalanine is selected, then a phenylalanine auxotroph is employed). 

Suitable auxotrophs include, but are not limited to CAG18491, B834(DE3), AD494, 
5 DL41,andML304d. 

Host cells may be either wild type cells or transformants. The term "transformants' 1 as 
used herein including products of transformation, transfection and transduction. 
Preferably, the polypeptides of interest to be produced by the cells according to the 
10 present invention are those which wild-type cells cannot produce. Thus, it is preferable 
that the cells to be used in the present invention be transformants. 

Host- Vector Systems 

15 The invention further discloses a host-vector system comprising a vector or vectors 
having nucleic acids encoding the aminoacyl-tRNA synthetase and polypeptide of interest. 

The host- vector system is used to produce the polypeptides of interest. The host cell can be 
either prokaryotic or eukaryotic. Examples of suitable prokaryotic host cells include 
20 bacterial strains from genera such as Escherichia, Bacillus, Pseudomonas, Streptococcus, 
and Streptomyces. Examples of suitable eukaryotic host cells include a yeast cell, a plant 
cell, or an animal cell, such as a mammalian cell. 

Introduction of the vectors of the present invention into an appropriate cell host is 
25 accomplished by well known methods that typically depend on the type of vector used and 
host system employed. For transformation of prokaryotic host cells, electroporation and salt 
treatment methods are typically employed, see for example, Cohen et al., 1972 Proc Acad 
Sci USA 69:2110; Maniatis, T., et al., 1989 Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY. Transformation of vertebrate cells 
30 with vectors by electroporation, cationic lipid or salt treatment methods, is typically 
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employed, see, for example, Graham et al, 1973 Virol 52:456; Wigler et al., 1979 Proc 
Natl Acad Sci USA 76:1373-76. 

Successfully transformed host cells, i.e;, cells that contain a vector of the present invention, 
5 are identified by well-known techniques. For example, cells resulting from the introduction 
of a vector of the present invention are selected and cloned to produce single colonies. Cells 
from those colonies are harvested, lysed and their nucleic acid content examined for the 
presence of the vector using a method such as that described by Southern, JMol Biol (1975) 
98:503, or Berent et al., Biotech (1985) 3:208, or the proteins produced from the cell are 
1 0 assayed via a biochemical assay or immunological method such as Western blotting. 

The methods of the invention in which the cloned gene is expressed in a suitable host 
cell, are preferred if longer polypeptides, higher yield, or a controlled degree of amino 
acid analogue incorporation is desired. For example, a suitable host cell is introduced 
15 with an expression vector having the nucleotide sequence encoding the polypeptide of 
interest. The host cell is then cultured under conditions that permit in vivo production of 
the desired polypeptide, wherein one or more naturally occurring amino acids in the 
desired polypeptide are replaced with the amino acid analogue analogues and derivatives. 

20 A preferred embodiment provides a host- vector system comprising the pQE15 (Qiagen, 
Santa Clara, CA) vector having a sequence encoding the the aminoacyl-tRNA synthetase 
and target polypeptide of invention, which is introduced along with the pREP4 (Qiagen) 
vector into an appropriate auxotrophic host cell such as E. coli methionine auxotroph 
CAG 18491 strain, which is useful, for example, for producing a polypeptide where a 

25 selected natural amino acid is replaced with a amino acid analogue. 

An embodiment of the host cells of the present invention are Escherichia coli and 
transformants thereof, and an example of the protein to be produced is dihydrofolate 
reductase. 
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Media 



Suitable media for growing the host- vector systems of the invention are well known in 
the art, for example, see Sambrook et al., Molecular Cloning (1989), supra. In general, a 
5 suitable media contains all the essential nutrients for the growth of the host- vector 
system. The media can be supplemented with antibiotics that are selected for host-vector 
system. 

The media may contain all 20 natural amino acids or lack a selected natural amino acid. 
10 The media may also contain an amino acid analogue in place of a selected natural amino 
acid. 

Potential Uses of Modified Polypeptides 

15 According to the present invention, it is now possible to create entirely new modified 
polypeptides, in which amino acid analogues, as well as the 20 natural amino acids are 
used as constituents. 

Modified polypeptides can be used to prepare functional drugs, antagonistic drugs or 
20 inhibitory agents. Also, using non-natural amino acids in protein engineering expands 
the potential designs of polypeptides. Since such modified polypeptides are not natural, 
they may be less susceptible to proteolytic enzymes generally present in cells. 

Introduction of amino acid analogues in polypeptides may produce modified polypeptides 
25 having a variety of side chains having highly active chemical functional groups. The 
reactivity of the various types of the functional groups introduced can be exploited to 
control protein structure and function. For example, polypeptides or proteins may be 
produced that have undergone site-specific phosphorylation, methylation or addition of 
sugar chains. It may be possible to produce modified polypeptides as derivatives 
30 analogous to specified proteins by the introduction of amino acid analogues having 
functional groups to form crosslinks so that cellular components which interact with the 
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specified proteins in the cells can be detected. Modified polypeptides with incorporated 
fluorescent amino acid residues are useful to trace metabolic pathways in organisms or to 
elucidate mechanisms of biological actions. It is possible to produce modified 
polypeptides having amino acid analogues which differ in acid dissociation constant from 
5 natural amino acids, so as to control properties of the polypeptides that depend on the 
acidity in aqueous solutions. 

It is possible to introduce amino acid analogues into polypeptides that will self-assemble 
so as to mimic viruses (e.g., coat proteins), muscle fibers (e.g., actin and myosin) or 
10 chromatin (e.g., histones) so as to create supra-molecular structures having specified 
functions. Additionally, the supra-molecular structures can be further modified in a 
biological system to create other supra-molecular structures having specified functions. 

It may be possible to add amino acid analogues according to the methods of the invention 
15 to artificial feeds for silk worms that can synthesize silk with the amino acid analogues. 
Further, it may be possible to produce protein fibers with optical properties from 
modified polypeptides into which amino acid analogues have been incorporated. In this 
regard, modified polypeptides with amino acid analogues having functional groups to 
form crosslinkages can produce supra-molecular structures with silk as supporting 
20 construction. Crosslinkages of the modified polypeptides can then produce new 
proteinaceous structures. Into the structures thus produced, non-natural fluorescent amino 
acids can be introduced, e.g. to make biochips for photoenergy transduction. 

ADVANTAGES OF THE INVENTION 

25 

The invention introduces a unique strategy that can be widely applied to incorporate 
amino acid analogues to substitute for any of the selected natural amino acid residues in 
polypeptides of interest. A greater range of amino acid analogues can be employed for 
protein synthesis. In addition, modified polypeptides produced using the methods of this 
30 invention can be produced in higher yields and with high levels of replacement of natural 
amino acids. 
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The method of this invention changes the building blocks of protein synthesis, leaving the 
"blueprint" encoding the proteins unchanged. The invention, therefore, permits a rapid 
and predictable approach to protein design and produces modified polypeptides with 
5 significantly increased yields and expansion of amino acid analogues that can serve as 
substrates for polypeptide synthesis. 

This method of this invention is generally applicable to a large range of proteins, 
enzymes, and peptides, and is not limited by the size or structure of the proteins or 

10 polypeptides. Incorporation of amino acid analogues with different functionalities, such 
as double bonds, can be utilized for further chemical derivatization of the polypeptide of 
interest. Furthermore, the feasibility of incorporating amino acid analogues using in vivo 
methods should allow the manipulation of enzymes, signaling molecules, protein ligands, 
and may prove to be of broad utility in the engineering of more versatile biological 

15 assemblies. 

The following examples are presented to illustrate the present invention and to assist one of 
ordinary skill in making and using the same. The examples are not intended in any way to 
otherwise limit the scope of the invention. 

20 

EXAMPLE I 

This example demonstrates the selectivity of methionyl t-RNA synthetase for methionine 
analogues and the efficient incorporation of unsaturated methionine analogues into 
25 proteins in vivo. 

Synthesis of amino acid analogues 

Each of the analogues 2 -7 and 11 (Figure 2) was prepared by alkylation of diethyl 
30 acetamidomalonate with the appropriate alkyl tosylate followed by decarboxylation and 
deprotection of the amine function. This section provides information on general 
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synthetic procedures and a detailed protocol for preparation of 2. Similar methods were 
used to prepare 3-7 and 11. Analogues 8, 9, 12, and 13 are available commercially 
(Sigma- Aldrich, St. Louis, Mo). Analogue 10 was prepared as described by Blackwell et 
al (H. E. Blackwell, R. H. Grubbs, Angew. Chem. 1998, 7/0, 3469-3472; Angew. Chem. 
5 Int. Ed 1998, 57, 3281-3284.). 

General Procedures. 

Glassware was dried at 150°C and cooled under nitrogen prior to use. Tetrahydrofuran 
10 (THF) was freshly distilled from Na^enzophenone. N,N-Dimethylformamide (DMF) 
was distilled and stored over BaO. Pyridine (99.8%, anhydrous, Aldrich) and other 
reagents and solvents were used as received. ] H NMR spectra were recorded on Bruker 
AC 200 and AMX 500 spectrometers and i3 C NMR spectra were recorded on a Bruker 
DPX 300 spectrometer. Column chromatography was performed with silica gel 60, 230- 
15 400 mesh (EM Science); silica 60-F254 (Riedel-de Haen) was used for thin layer 
chromatography. 

DL-2-amino-5-hexenoic acid (2) 

20 (Drinkwater, D.J.; Smith, P.W.G. J. Chem, Soc. C 1971, 1305; Baldwin, J.E.; Hulme, 
C; Schofield, CJ. J. Chem. Res. (S) 1992, 173). 

3-Buten-l-ol 4-methylbenzene sulfonate. 

25 A solution of 3 g (42 mmol) 3-buten-l-ol in 10 mL dry pyridine was cooled in an ice 
bath. Tosyl chloride (7.9 g, 42 mmol), was added. After stirring for 3 h the mixture was 
poured into 30 mL of an ice/concentrated HC1 4/1 v/v solution, extracted with 60 mL 
diethyl ether and dried overnight in the freezer over MgSC>4. The mixture was filtered 
and the ether evaporated to yield 7.22 g (76%) of 3-buten-l-ol 4-methylbenzene sulfonate 

30 as a yellow oil. *H NMR (CDC1 3 ): 5 2.39-2.53 (m, 2H, J = 6.5 and 6.9 Hz, CH2- 
CH=CH 2 ; and s, 3H, CHs-Ar), 4.08 (t, J= 6.5 Hz, 2H, CH2OSO2), 5.09-5.15 (m, 2H, J z = 
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10.4, J E = 16.6, Jgem = 3.1 Hz, CH 2 -CH=CH2), 5.57-5.82 (m, 1H, J z = 10.4, J ^ 16.6, J 
= 6.9 Hz, CH 2 -CH=CH 2 ), 7.38 and 7.72 (d, 4H, J= 8.6 Hz, Ar). 



Acetylamino-3-butenyl-propanedioic acid diethyl ester. 

5 

Diethyl acetamidomalonate, 1.56 g (6.9 mmol), was dissolved at room temperature under 
N 2 in 10 mL dry THF. Potassium ter/-butoxide (0.80 g, 7 mmol), was added under 
vigorous stirring. The mixture was heated for 2 h at 60°C 3-Buten-l-ol 4- 
methylbenzenesulfonate (1.5 g, 6.9 mmol) was added, and the mixture was heated under 

10 reflux for 2 days. The THF was removed, the residue was quenched with 10 mL 1 M 
HC1, and the crude product was extracted with ethyl acetate (25 mL). The ethyl acetate 
solution was washed twice with 25 mL water, dried over MgSC>4, filtered and 
concentrated. The crude product was purified by column chromatography (eluent 
cyclohexane/ethyl acetate 2/1 v/v) to yield 0.82 g (44%) of acetylamino-3-butenyl- 

15 propanediol acid diethyl ester. ! H NMR (CDC1 3 ): 5 1.28 (t, 6H, J = 12 Hz, CH3-CH 2 ), 
1.78-2.0 (m, 2H, J= 8.3, 6.5 Hz, CH 2 =CH-CH2-CH 2 ), 2.08 (s, 3H, CONH-CH3), 2.45 (m, 
2H, J = 8.3 Hz, CH 2 =CH-CH 2 -CH2), 4.25 (q, 4H, J = 7.2 Hz, CH3-CH2), 4.90-5.09 (m, 
2H, J z = 10.4, J E = 16 A J gcm = 3.2 Hz, CHrCH-CHb), 5.61-5.90 (m, 1H, Jz= 10.4, J E = 
16.6, J= 6.5 Hz, CH 2 -CH=CH 2 ), 6.78 (s, 1H, CONH-CH3). 

20 

DL-2-amino-5-hexenoic acid. 

The diethyl ester obtained as described above was hydrolyzed to the dicarboxylate by 
heating under reflux for 4 h in 25 mL 10 w% NaOH. The solution was neutralized with 6 

25 M HC1 and the solvent was evaporated. The diacid was extracted with 25 mL of 
methanol. Following solvent evaporation, 20 mL 1M HC1 was added and the solution 
was refluxed for 3 h. The solvent was evaporated and the product was taken up in 10 mL 
methanol. Propylene oxide (5 mL) was added and the mixture was stirred overnight at 
room temperature. The precipitate was filtered and dried, yielding DL- 2-amino-5- 

30 hexenoic acid (0.47 g, 63%). The product was recrystallized from EtOH/H 2 0 2/1 v/v 
(0.28 g, 60%). The ! H NMR data were in agreement with those of reference 16 
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(Hatanaka, S.-L; Furukawa, J.; Aoki, T.; Akatsuka, H.; Nagasawa, E. Mycoscience, 
1994, 35, 391). *H NMR (D 2 0): 5 1.78-2.0 (m, 2H, J= 6.4, 6.6 Hz, CH 2 =CH-CH2-CH 2 ), 
2.08-2.20 (m, 2H, J = 6.1, 6.4 Hz, CH 2 =CH-CH 2 -CH2), 3.75 (t, 1H, J = 6.1 Hz, H 2 N- 
CH-COOH), 4.90-5.12 (m, 2H, J z = 10.5, J E = 16.7, J gem = 3.3 Hz, CH 2 -CH=CH2), 5.61- 
5 5.90 (m, 1H, J r= 10.5, J E = 16.7, J = 6.6 Hz, CH 2 -CH=CH 2 ). 13 C NMR (D 2 0): 5 28.9 
(CH 2 =CH-CH 2 -CH 2 ), 29.9 (CH 2 =CH-CH 2 -CH 2 ), 54.4 (H 2 N-CH-COOH), 116.3 (CH 2 - 
CH=CH 2 ), 137.3 (CH 2 -CH=CH 2 ), 174.8 (COOH). 

Determination of translational activity 

10 

Buffers and media were prepared according to standard protocols (Sambrook, J.; Fritsch, 
E.F.; Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd Ed Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y. 1989; Ausubel, F.M.; Brent, K.; 
Kingston, K.E.; Moore, D.D.; Scidman, J.G.; Smith, J.A.; Struhl, K. Current Protocols 
15 in Molecular Biology, John Wiley & Sons, NY 1995). The E. coli methionine auxotroph 
CAG18491 (X, rph-l metE3079.:Tn!0) (obtained from the Yale E. coli Genetic Stock 
Center), was transformed with plasmids pREP4 and pQE15 (Qiagen, Valencia, CA), to 
obtain the expression host CAG18491/pQE15/pREP4. 

20 Protein expression (5 mL scale). 

M9AA medium (50 mL) supplemented with 1 mM MgS04, 0.2 wt% glucose, 1 mg/L 
thiamine chloride and the antibiotics ampicillin (200 mg/L) and kanamycin (25 mg/L) 
was inoculated with 2 mL of an overnight culture of CAG18491/pQE15/pREP4. When 

25 the turbidity of the culture reached an optical density at 600 nm (OD600) of 0.8, a medium 
shift was performed. The cells were sedimented for 10 min at 3030 g at 4°C, the 
supernatant was removed, and the cell pellet was washed twice with 20 mL of M9 
medium. Cells were resuspended in 50 mL of the M9AA medium described above, 
without methionine. Test tubes containing 5 mL aliquots of the resulting culture were 

30 prepared, and were supplemented with 200 ^L 1 mg/mL (0.27 mM) L-methionine (1) 
(positive control), DL-2-amino-5-hexenoic acid (2) (0.31 mM), DL- 
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homopropargylglycine(3) (0.3 1 mM), cis- or trans-DL-2-amino-4-hexenoic acid (4 or 5) 
(031 mM), DL-6,6,64rifluoro-2-amino hexanoic acid (6) (0.22 mM), DL-2- 
aminoheptanoic acid (7) (0.28 mM), L-norvaline (8) (0.34 mM) or L-norleucine (9) (0.31 
mM), respectively. A culture lacking methionine (or any analogue) served as the 
5 negative control. Protein expression was induced by addition of isopropyl-P-D- 
thiogalactopyranoside (IPTG) to a final concentration of 0.4 mM. Samples were taken 
every hour for 4 h, the OD600 was measured, and the samples were sedimented. After the 
supernatant was decanted, the cell pellets were resuspended in 20 \iL distilled H2O. 
Protein expression was monitored by SDS polyacrylamide gel electrophoresis (12% 
10 acrylamide running gel, 12 mA, 14 h), using a normalized OD600 of 0.2 per sample. 

Protein expression (1 L scale) 

Similar procedures were used for preparation and isolation of mDHFR from media 
1 5 supplemented with 1, 2 or 3. The example presented is for medium supplemented with 3. 
M9AA medium (100 mL) supplemented with 1 mM MgS0 4 0.2w% glucose, 1 mg/L 
thiamine chloride and the antibiotics ampicillin (200 mg/L) and kanamycin (25 mg/L) 
was inoculated with E. coli strain CAG18491/pQE15/pREP4 and grown overnight at 
37°C. This culture was used to inoculate 900 mL M9AA medium supplemented as 
20 described. The cells were grown to an OD 60 o of 0.94 and the medium shift was 
performed as described for the small scale experiments, followed by addition of 40 mL of 
1 mg/mL DL-homopropargylglycine (3). IPTG (0.4 mM) was added, and samples were 
taken at 1 hour intervals. OD600 was measured, the samples were sedimented and 
decanted, and the cell pellets were resuspended in 20 ^L distilled H 2 0. Protein 
25 expression was monitored by SDS polyacrylamide gel electrophoresis (12% acrylamide 
running gel, 12 mA, 15 h). 

Protein purification 

30 Approximately 4.5 h after induction, cells were sedimented (9,800 g, 10 min, 4°C) and 
the supernatant was removed. The pellet was placed in the freezer overnight. The cells 
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were thawed for 30 min at 37°C, 30 mL of buffer (6 M guanidine-HCl, 0.1 M NaH 2 P0 4 , 
0.01 M Tris, pH 8) was added and the mixture was shaken at room temperature for 1 h. 
The cell debris was sedimented (15,300 g, 20 min, 4°C) and the supernatant was 
subjected to immobilized metal affinity chromatography (Ni-NTA resin) according to the 
5 procedure described by Qiagen (The Qiagen Expressionist, Purification Procedure 7, 
1992, 45). The supernatant was loaded on 10 mL of resin which was then washed with 
50 mL of guanidine buffer followed by 25 mL of urea buffer (8 M urea, 0.1 M NaHbPC^ 
and 0.01 M Tris, pH 8). Similar urea buffers were used for three successive 25 mL 
washes at pH values of 6.3, 5.9 and 4.5, respectively. Target protein was obtained in 

10 washes at pH 5.9 and 4.5. These washes were combined and dialyzed (Spectra/Por 
membrane 1, MWCO=6-8 kDa) against running distilled water for 4 days, followed by 
batchwise dialysis against doubly distilled water for one day. The dialysate was 
lyophilized to yield 70 mg of modified mDHFR (mDHFR-Y). A similar procedure using 
medium supplemented with 2 yielded 8 mg of mDHFR-E. A control experiment in 2xYT 

15 medium afforded 60 mg of mDHFR. Amino acid analyses, electrospray mass 
spectrometry and N-terminal protein sequencing was performed on the mDHFR isolated. 

Enzyme purification and activation assays 

20 The folly active, truncated form of methionyl tRNA synthetase (MetRS) was purified 
from overnight cultures of JM101 cells carrying the plasmid pGG3. (The plasmid, which 
encodes the tryptic fragment of MetRS, Ghosh, G.; Brunie, S.; Schulman, L.H. J. Biol 
Chem. 1991, 266, 17136-17141). The enzyme was purified by size exclusion 
chromotography as previously described (Mellot, P.; Mechulam, Y.; LeCorre, D.; 

25 Blanquet, S.; Fayat, G. J. Mol Biol 1989, 208, 429). Activation of methionine 
analogues by MetRS was assayed via the amino-acid-dependent ATP-PPj exchange 
reaction, also as previously described (Mellot, P.; Mechulam, Y.; LeCorre, D.; Blanquet, 
S.; Fayat, G. J. Mol Biol 1989, 208, 429; Blanquet, S.; Fayat, G.; Waller, J.-P. Eur. J. 
Biochem. 1974, 44, 343; Ghosh, G.; Pelka, H.; Schulman, L. H. Biochemistry 1990, 29, 

30 2220). The assay, which measures the P-radiolabeled ATP formed by the enzyme- 
catalyzed exchange of 32 P-pyrophosphate (PPj) into ATP, was conducted in 150 \x\ of 
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reaction buffer (pH 7.6, 20 mM imidazole, 0.1 mM EDTA, 10 mM (3-mercaptoethanol, 7 
mM MgCl 2 , 2 mM ATP, 0.1 mg/ml BSA, and 2 mM PPj (in the form of sodium 
pyrophosphate with a specific activity of approximately 0.18 TBq/mole)). Assays to 
determine if the methionine analogues 2 - 13 are recognized by MetRS were conducted in 
5 solutions 75 nM in enzyme and 5 mM in the L-isomer of the analogue with a reaction 
time of 20 minutes. Kinetic parameters for analogue 5 were obtained with an enzyme 
concentration of 75 nM and analogue concentrations of 100 |iM to 10 mM. Parameters 
for methionine were obtained by using concentrations ranging from 10 ^iM to 1 mM. K m 
values for methionine matched those previously reported (Ghosh, G.; Pelka, H.; 

10 Schulman, L. H.; Brunie, S. Biochemistry 1991, 30, 9569), though the measured kcat was 
somewhat lower than the literature value. Aliquots of 20 \i\ were removed from the 
reaction mixture at various time points and were quenched in 0.5 ml of a solution 
comprising 200 mM PPj, 7% w/v HCIO4, and 3% w/v activated charcoal. The charcoal 
was rinsed twice with 0.5 mL of a 10 mM PPj, 0.5% HC10 4 solution and was then 

15 resuspended in 0.5 mL of this solution and counted via liquid scintillation methods. 
Kinetic constants were calculated by nonlinear regression analysis. 

Computation 

20 Single-point energy ab initio calculations (Hartree-Fock model, 6-3 1G* basis set) (Hehre, 
W.J.; Ditchfield, R.; Pople, J.A. J. Chem. Phys. 1972, 56, 2257; Hariharan, P.C.; Pople, 
J.A. Chem. Phys. Lett. 1972, 66, 217; Francl, M.M. et al. J. Chem. Phys. 1982, 77, 
3654) were performed for methionine and for analogues 2, 3 and 5 with fully extended 
side chains. Electron density maps are shown as surfaces of electron density 0.08 

25 electrons/au 3 . Isopotential plots are represented as surfaces where the energy of 
interaction between the amino acid and a point positive charge is equal to -10 kcal/mole. 
Calculations were performed by using the program MacSpartan (Wavefunction, Inc., 
Irvine, CA, USA). 
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Results and Discussion 



Methionine Analogues 

5 Methionine analogues 2-13 were investigated with respect to their capacity to support 
protein synthesis in E. coli cells depleted of methionine. Norvaline (8) and norleucine (9), 
allylgycine (12), and propargylglycine (13) are commercially available. Analogues 2-7 
and 11 were prepared by alkylation of diethyl acetamidomalonate with the corresponding 
tosylates via standard procedures, and the remaining analogues were prepared as 

10 described supra. In the cases of the cis- and /rara-crotylglycines (4 and 5) the tosylates 
were prepared in situ, and because of fast exchange of the tosyl group with chloride ion, 
mixtures of the chloride and the tosylate were obtained. Hydrolysis of the malonate and 
conversion to the amino acid had to be performed under mild acidic conditions for 
analogues 2, 4 and 5; treatment with 6 N HC1, or reflux in 1 N HC1 for more than 5 h led 

15 to HC1 addition to the double bond. In all cases the analogues were obtained as 
racemates and were used as such. 

Protein Expression 

20 E. coli strain CAG18491/pQE15/pREP4, which produces the test protein mDHFR upon 
induction with IPTG, was used as the expression host. The parent strain CAG18491 is 
dependent on methionine for growth, owing to insertion of transposon TnlO into the metE 
gene, which is essential for the final step in the endogenous synthesis of methionine. 
Cultures were grown in minimal medium supplemented with methionine until a cell 

25 density corresponding to OD600 0.8-1.0 was reached. Cells were sedimented, washed and 
resuspended in minimal medium without methionine. Aliquots of the culture were then 
supplemented with one of the analogues 2 - 13. Protein synthesis was induced with IPTG 
and cell growth and protein expression were followed over a 4 h period. Expression 
results are presented in Figure 3, and show clearly that analogues 2 and 3 exhibit 

30 translational activity sufficient to allow protein synthesis in the absence of methionine. 
Analogues 4 - 8 and 12-13 are not active in the assay reported here, while the known 
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translational activity of norleucine (9) was confirmed. CAG18491/pQE15/pREP4cultures 
did not grow in minimal media in which methionine was replaced by 2 and 3, at the time 
of inoculation. 

5 Analysis af Protein Structure 

The extent of replacement of methionine by analogues 2 and 3 was determined by an 
amino acid analysis, N-terminal sequencing, and (for 2) ! H nuclear magnetic resonance 
spectroscopy (Table 1). Proteins containing 2 and 3 were designated mDHFR-E (alkene) 
1 0 and mDHFR- Y (alkyne), respectively. 



Table 1. Protein Yield and Extent of Methionine Replacement 



Protein 


Yield (mg) a 


Replacement (%) 
Amino Acid Analysis 


Sequencing 


'HNMR 


mDHFR-E 


8 


86 


92 


77 


mDHFR-Y 


70 


100 


88 


Not 










determined 



3 Yield of purified protein obtained from 1 L of CAG18491/pQE15/pREP4culture grown 
15 to OD 6 oo = 0.94 prior to induction by addition of IPTG. The yield of mDHFR obtained 
from control cultures supplemented with methionine was approximately 70 mg/L. 

mDHFR-E. 

20 Amino acid analysis of mDHFR-E showed a methionine content of 0.5 mol% vs. the 
value of 3.8 mol% expected for mDHFR. Although 2 appears to be unstable under the 
conditions used to hydrolyze the protein for amino acid analysis, assumption that the 
decrement in methionine content is due to replacement by 2 affords an estimate of 86% 
substitution by the analogue. This estimate is consistent with the results of N-terminal 

25 sequencing of mDHFR-E (Figure 4), which indicates 92% occupancy of the initiator site 
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by 2. In the chromatograms shown in Figure 4, the signal due to methionine appears at a 
retention time of 12.3 min, while that from 2 elutes at 14.3 min. The retention time of the 
signal arising from 2 was verified by analysis of an authentic sample of the analogue. 
Retention of the N-terminal residue in mDHFR was expected on the basis of the known 
5 correlation between the extent of methionine excision from E. coli proteins and the 
identity of the penultimate amino acid residue (Hirel, P.H.; Schmitter, J.M.; Dessen, P.; 
Fayat, G.; Blanquet, S. Proc. Natl Acad. Set USA 1989, 86, 8247). Finally, direct 
evidence for incorporation of the alkene function of 2 was obtained from *H NMR 
spectroscopy. The vinyl CH resonance of 2 appears at a chemical shift of 5.7 ppm in the 
10 spectrum of mDHFR-E, and can be integrated to yield an estimate of 77% replacement of 
methionine by the unsaturated analogue. A yield of 8 mg of mDHFR-E was obtained 
from a 1 L culture of CAG18491/pQE15/pREP4grown in M9AA medium supplemented 
with 2, compared with 70 mg obtained from a similar experiment in medium 
supplemented with methionine. 

15 

mDHFR-Y. 

Methionine could not be detected via amino acid analysis of mDHFR-Y, suggesting 
quantitative replacement of methionine by the alkyne analogue 3. N-terminal sequencing 

20 (Figure 4) indicated 88% occupancy of the initiator site by 3. ] U NMR analysis of 
mDHFR-Y was consistent with near-quantitative replacement of methionine, as the 
thiomethyl resonance at 2.05 ppm - which is prominent in the spectrum of mDHFR - 
could not be detected. New signals at 2.2-2.3 ppm - which are not observed in the 
spectrum of mDHFR and which correspond to signals due to the j3- and s-protons of 3 - 

25 appeared in the spectrum of mDHFR-Y, but were not integrated carefully owing to 
overlap with neighboring resonances. The yield of mDHFR-Y obtained from M9AA 
medium supplemented with 3 was essentially identical to that of mDHFR isolated from 
media supplemented with methionine. 

30 
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Enzyme Assays 

The relative rates of activation of methionine and methionine analogues 2 - 13 by MetRS 
were estimated by the ATP-PPj exchange assay. The results shown in Figure 5 illustrate 
5 the amount of PPi exchanged at a reaction time of 20 minutes under standard assay 
conditions (see Experimental Section). Methionine (1) is activated most efficiently by 
the enzyme, causing exchange of 9 nmoles of PPj over the time course of the reaction. 
Analogues 2 and 3 cause exchange of PPj at rates similar to that of norleucine (9), while 
the remaining analogues 4, 6 - 8, and 12-13 cause exchange of PPj at levels no higher 

10 than background (Figure 5, lanel4). The background (lane 14) is given for a reaction 
mixture lacking both the enzyme and the amino acid. Although analogues 5 and 11 effect 
very slow exchange of PPi, the activation rate is apparently too low to support protein 
synthesis at a level that is detectable in the in vivo assays. Kinetic parameters were 
determined for methionine and 5 as outlined in the Experimental Section. Comparison of 

15 the kcat/K m values obtained for methionine (0.54 s" 1 ^iM' 1 ) and 5 (1.1 x 10" 4 s" 1 \jM' 1 ) show 
that 5 is activated 4700-fold less efficiently than methionine by MetRS. Comparison of 
the kcat/K m values obtained for methionine (0.54 s" 1 fiM" 1 ) and 11 (3.9 x 10" 5 s" 1 fiM" 1 ) 
show that 11 is activated 13825-fold less efficiently than methionine by MetRS. 

20 Discussion 

A bacterial host strain (designated CAG18491/pQE15/pREP4) suitable for testing the 
translational activity of methionine analogues 2-8 and 10-13 was prepared by 
transformation of £ coli strain CAG 18491, a methionine auxotroph, with the repressor 

25 plasmid pREP4 and the expression plasmid pQE15. pQE15 encodes mouse dihydrofolate 
reductase (mDHFR) under control of a bacteriophage T5 promoter, and appends to 
mDHFR an N-terminal hexahistidine sequence that facilitates purification of the protein 
by immobilized metal affinity chromatography. mDHFR contains eight methionine 
residues, each a potential site for substitution by analogues 2-8 and 10-13. The 

30 translational activity of each analogue was assayed on the basis of its capacity to support 
synthesis of mDHFR in cultures of CAG 18491 /pQE15/pREP4 that had been depleted of 
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methionine. In those instances in which the test protein was detected by gel 
electrophoresis (i.e., for 2 and 3), the modified mDHFR was purified and analyzed to 
determine the extent of methionine replacement by the analogue. 



5 The results of the in vivo assays illustrated in Figure 3 show clearly that homoallylglycine 
(2) and homopropargyl glycine (3) serve effectively as methionine surrogates in bacterial 
protein synthesis. In contrast, analogues 4- 8 and 10-13 do not support measurable levels 
of protein synthesis in bacterial cultures depleted of methionine. It is highly unlikely that 
recognition by the elongation factors of the ribosome or transport into the cell are the 

10 limiting factors for incorporation of these analogues. The ribosome is remarkably 
permissive toward amino acid analogues with widely varying chemical functionality, as 
has been demonstrated by the numerous analogues incorporated into proteins in in vitro 
translation experiments (Cornish, V.W.; Mendel, D.; Schultz, P.G. Angew. Chem. Int. 
Ed. Engl 1995, 34, 621; Robertson, S.A.; Ellman, J. A.; Schultz, P.G. J. Am. Chem. Soc. 

15 1991, 775, 2722; Noren, C.J.; Anthony-Cahill, S.I; Griffith, M.C.; Schultz, P.G. 
Science 1989, 244, 182; Bain, J.D.; Glabe, C.G.; Dix, T.A.; Chamberlin, A.R. J. Am. 
Chem. Soc. 1989, 777, 8013; Bain, J.D. et al. Nature 1992, 356, 537; Gallivan, J.P.; 
Lester, H.A.; Dougherty, D.A. Chem. Biol. 1997, 4, 740; Turcatti, et al. J. Biol Chem. 
1996, 277, 19991; Nowak, M.W. et al. Science, 1995, 268, 439; Saks, M.E. et al. J. 

20 Biol Chem. 1996, 277, 23169; Hohsaka, T. et al. J. Am. Chem. Soc. 1999, 727, 34). 

Transport of several analogues into the cell is indicated by a number of literature reports. 

Analogue 4 is an antagonist for methionine, inhibiting the growth of E. coli cells 

(Skinner, C. G.; Edelson, J.; Shive, W. J. Am. Chem. Soc. 1961, 83, 2281); 5 has been 
25 incorporated into proteins in E. coli cells with appropriately engineered MetRS activity ; 

and 8 replaces leucine in human hemoglobin expressed in E. coli (Apostol, I.; Levine, J.; 

Lippincott, J.; Leach, J.; Hess, E.; Glascock, C. B.; Weickert, M. J.; Blackmore, R. J. 

Biol. Chem. 1997, 272, 28980). Although there is no similar evidence reported for 

analogues 6 and 7, the fact that trifluoromethionine and ethionine are incorporated into 
30 proteins expressed in E. coli (Hendrickson, W.A.; Horton, J.R.; Lemaster, D.M. EMBO 

J. 1990, 9, 1665; Boles, J.O. et al Nature Struct. Biol 1994, 7, 283; Cowie, D.B.; 
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Cohen, G.N.; Bolton, E.T.; de Robichon-Szulmajster, H. Biochim. Biophys. Acta 1959, 
34, 39; Duewel, H.; Daub, E.; Robinson, R.; Honek, J.F. Biochemistry 1997, 36, 3404; 
Budisa, N.; Steipe, B.; Demange, P.; Eckerskorn, C; Kellerman, J.; Huber, R. Eur. J. 
Biochem. 1995, 230, 788) suggests that neither the trifluoromethyl group nor the longer 
5 side chain will inhibit transport of analogues 6 and 7 into E. coli cells. 

The results of the in vitro enzyme assays shown in Figure 5 are consistent with the in vivo 
results, as the analogues that support the highest rates of PPj exchange also support 
protein synthesis in the absence of methionine. Although the in vitro results indicate that 

10 5 and 11 are recognized by MetRS, comparison of the kc at /K m values of methionine and 5 
and 11, demonstrate that 5 is activated 4700-fold and 11 13825-fold less efficiently than 
methionine; thus it is not surprising that neither 5 or 11 support measurable protein 
synthesis in the in vivo experiments. Consideration of the in vivo and in vitro results, 
along with the reports cited earlier, suggests that transport is not limiting and that 

1 5 analogue incorporation is controlled by the MetRS. 

Although the crystal structure of an active tryptic fragment of the E. coli MetRS 
(complexed with ATP) has been reported (Brunie, S.; Zelwer, C; Risler, J.L. J. Mol 
Biol 1990, 216, 411; Mechulam, Y.; Schmitt, E.; Maveyraud, L.; Zelwer, C; Nureki, O.; 

20 Yokoyama, S.; Konno, M.; Blanquet, S. J. Mol Biol 1999, 294, 1287 r -1297), the 
corresponding structure with bound methionine is not yet available. Inferences 
concerning the mechanism of methionine (or analogue) recognition by MetRS have 
heretofore been made indirectly, on the basis of sequence comparison and site-directed 
mutagenesis (Ghosh, G.; Pelka, H.; Schulman, L. H.; Brunie, S. Biochemistry 1991, 30, 

25 9569; Fourmy, D.; Mechulam, Y.; Brunie, S.; Blanquet, S.; Fayat, G. FEBS Lett 1991, 
292, 259; Kim, H.Y.; Ghosh, G.; Schulman, L.H.; Brunie, S.; Jakubowski, H. Proc. 
Natl Acad. Sci. USA 1993, 90, 11553). 

Figure 6 compares the equipotential surfaces calculated for methionine and for analogues 
30 2, 3 and 5. That 2 might serve as a substrate for the methionyl-tRNA synthetase is not 
surprising, given the similar geometries accessible to 1 and 2, the availability of n- 
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electrons near the side-chain terminus of 2, and the known translational activity of 
norcleucine (9), the saturated analogue of 2. The high translational activity observed for 
3, (i.e. near-quantitative replacement of methionine without loss of protein yield), was not 
anticipated, since the colinearity of side-chain carbons 4-6 imposes on 3 a geometry 
5 substantially different from that of methionine. However, the electron density associated 
with the triple bond of 3 is positioned similarly to that of the thioether of the natural 
substrate, despite the differences in side-chain geometry. Furthermore, given the 
important roles assigned to residues Phel97 and Trp305 in the E. coli methionyl-tRNA 
synthetase (Ghosh, G.; Pelka, H.; Schulman, L. H.; Brunie, S. Biochemistry 1991, 30, 

10 9569; Fourmy, D.; Mechulam, Y.; Brunie, S.; Blanquet, S.; Fayat, G. FEBS Lett. 1991, 
292, 259; Kim, H.Y.; Ghosh, G.; Schulman, L.H.; Brunie, S.; Jakubowski, H. Proc. 
Natl Acad Sci. USA 1993, 90, 11553), alkynyl C-H/tt contacts (Steiner, T.; Starikov, 
EJB.; Amado, A.M.; Teixeira-Dias, J.J.C. J. Chem. Soc. Perk. Trans. 2, 1995, 7, 1321) 
and the polarizability of the unsaturated side chain may also play significant roles in 

15 recognition of 3 by the enzyme. Figure 6 also compares the geometries of 1 and 5, the 
latter an analogue neither recognized efficiently by the MetRS in vitro nor translationally 
active in vivo. Although the geometries of 1 and 5 appear similar in the representation 
shown, the fixed planarity of the C4-C5 bond may preclude the side-chain conformation 
required for efficient recognition of 5 by MetRS. Appropriate engineering of the MetRS 

20 activities of E. coli imparts translational activity to 5. 

In conclusion, a set of twelve methionine analogues was assayed for translational activity 
in Escherichia coli. Norvaline and norleucine, which are commercially available, were 
assayed along with homoailylglycine (2), homopropargyl glycine (3), ris-crotylglycine 

25 (4), rnmy-crotylglycine (5), 6,6,6-trifluoro-2-aminohexanoic acid (6) and 2- 
aminoheptanoic acid (7) and 2-butynylglycine (11), each of which was prepared by 
alkylation of diethyl acetamidomalonate with the appropriate tosylate, followed by 
hydrolysis. The other analogues were commercially available or prepared as described 
supra. The E. coli methionine auxotroph CAG 18491, transformed with plasmids pREP4 

30 and pQE15, was used as the expression host, and translational activity was assayed by 
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determination of the capacity of the analogue to support synthesis of the test protein 
dihydrofolate reductase (mDHFR) in the absence of added methionine. 

The importance of amino acid side chain length was illustrated by the fact that neither 
5 norvaline (8) nor 7 showed translational activity, in contrast to norleucine (9), which does 
support protein synthesis under the assay conditions. The internal alkene functions of 4 
and 5 prevented incorporation of these analogues into test protein, and the fluorinated 
analogue 6 and 10-13 yielded no evidence of translational activity. The terminally- 
unsaturated compounds 2 and 3, however, proved to be excellent methionine surrogates: 

10 *H NMR spectroscopy, amino acid analysis and N-terminal sequencing indicated ca 85% 
substitution of methionine by 2, while 3 showed 90-100% replacement. Both analogues 
also function efficiently in the initiation step of protein synthesis, as shown by their near- 
quantitative occupancy of the N-terminal amino acid site in mDHFR. Enzyme kinetics 
assays were conducted to determine the rate of activation of each of the methionine 

15 analogues by methionyl tRNA synthetase (MetRS); results of the in vitro assays 
corroborate the in vivo incorporation results, suggesting that success or failure of 
analogue incorporation in vivo is controlled by MetRS, 

EXAMPLE II 

20 

This example demonstrates the expansion of the scope of methionine analogues for 
incorporation into proteins in vivo by altering the methionyl-tRNA synthetase activity of 
a bacterial expression host. 

25 The relative rates of activation of methionine and methionine analogues 2-13 (Figure 2) 
by MetRS were characterized in vitro by the ATP-PPj exchange assay. The fully active, 
truncated form of MetRS was purified from overnight cultures of JM101 cells carrying 
the plasmid pGG3. The enzyme was purified by size exclusion chromotography as 
previously described (P. Mellot, Y. Mechulam, D. LeCorre, S. Blanquet, G. Fayat, J. 

30 Mol Biol 1989, 208, 429-443). Activation of methionine analogues by MetRS was 
assayed at 25°C via the amino acid-dependent ATP-PPj exchange reaction, also as 
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described in Example I (G. Ghosh, H. Pelka, L. H. Schulman, Biochemistry 1990, 29, 
2220-2225). Assays to determine if the methionine analogues 2 - 13 were recognized by 
. MetRS were conducted in solutions 75 nM in enzyme and 5 mM in the L-isomer of the 
analogue with a reaction time of 20 minutes. Kinetic parameters for analogue 5 were 
5 obtained with an enzyme concentration of 50 nM and analogue concentrations of 100 \xM 
to 10 mM. Kinetic parameters for analogue 11 were determined using an enzyme 
concentration of 50 nM and analogue concentrations ranging from 750 ^M to 20 mM. 
Kinetic parameters for analogues 4, 7, 8, and 12 were obtained with an enzyme 
concentration of 50 or 75 nM and analogue concentrations ranging from 5 to 70 mM. 

10 Parameters for methionine were obtained by using concentrations ranging from 10 ^iM to 
1 mM. K m values for methionine were similar to those previously reported (24 ± 2 jiM), 
though the measured k cai was somewhat lower than the literature value (13.5 s" 1 ) (H. Y. 
Kim, G. Ghosh, L. H. Schulman, S. Brunie, H. Jakubowski, Proc. Natl Acad. Set USA 
1993, 90, 11553-11557). Kinetic constants were calculated by nonlinear regression 

15 analysis. 

Figure 5 demonstrates that analogues 2 and 3 are activated by MetRS, as anticipated on 
the basis of the in vivo experiments (as described in Example I, infra; J. C. M. van Hest, 
D. A. Tirrell, FEES Lett. 1998, 428, 68-70), although they cause exchange of PPi at rates 
20 several-fold lower than methionine. Analogue 4 does not cause measurable exchange of 
PPi by MetRS in vitro, which was expected since neither 4 nor 5 were indicated to be 
translationally active in vivo. 

Analogues 5 and 11, however, were activated by MetRS, causing slow exchange of PPi 
25 under the assay conditions used herein. Table 2 shows the kc at /K m values obtained for 
methionine, 2, 3, 5, 9, and 11. Given that k cei t/K m for 5 is 4700-fold and that for 11 is 
13825-fold lower than that for methionine (as described in Example I, infra), it is not 
surprising that neither 5 or 11 support measurable protein synthesis within the time frame 
of the in vivo experiments. 

30 
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These results suggest that increasing the MetRS activity of the expression host might 
allow efficient protein synthesis in cultures supplemented with 5 or 11. This strategy was 
not employed previously for incorporating amino acid analogues into proteins in vivo, but 
reports of in vivo misacylation of tRNA substrates by overexpressed aminoacyl-tRNA 
5 synthetase supported the viability of the approach (S. Li, N. V. Kumar, U. Varshney, U. 
L. RajBhandary, J. Biol. Chem. 1996, 271, 1022-1028; J. M. Sherman, M. J. Rogers, D. 
Soil, Nuc. Acids. Res. 1992, 20, 2847-2852; U. Varshney, U. L. RajBhandary, J. 
Bacteriol. 1992, 174, 7819-7826; R. Swanson, P. Hoben, M. Sumner-Smith, H. Uemura, 
L. Watson, D. Soli, Science 1988, 242, 1548-1551). 

10 

Table 2. k^/Km values obtained for methionine, 2, 3, 5, 9, and 1 1 

Analogue K m (uM) k ca t(s'') k C at/K m (s"VM"') Protein Yield, mg/L 

15 



25 



1 


24.3 ± 2 


13.3 ±0.2 


5.47x10-' 


35 


3 


2415 ± 170 


2.60 ±0.3 


1.08xl0- 3 


35 


9 


4120 ±900 


2.15 ±0.6 


5.22x1 0" 4 


20 


2 


4555 ± 200 


1.35 ±0.1 


2.96x1 0" 4 


10 


5 


15,675 ±250 


1.82 ±0.6 


\.\6x\Q- 4 


0 


11 


38,650 ± 2000 


1.51 ±0.5 


3.91xlO- 5 





3 0 Generation of host-vector system 

A bacterial host capable of overexpressing MetRS was produced by transforming E. coli 
strains B834(DE3) (Novagen, Inc., Madison, WI, USA), a methionine auxotroph, with 
repressor plasmid pREP4 and expression plasmid pQE15-MRS (Figure 19) (SEQ ID 
35 NO.: 1). A gene encoding a mutant MetRS was removed from plasmid pBSM547W305F 
(D. Fourmy, Y. Mechulam, S. Brunie, S. Blanquet, G. Fayat, FEBS Lett. 1991, 292, 259- 
263) by treatment with restriction enzymes Sac I and Kpn L The Sac I/Kpn I fragment 
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(2450 bp) was ligated into the cloning vector pUC19-Nhelink, which was constructed to 
permit the cohesive ends of the mutant MetRS gene to be changed to Nhe L The MetRS 
gene with Nhe I cohesive ends was then ligated into the unique Nhe 1 site of the plasmid 
pQE15 (Qiagen, Inc., Santa Clarita, CA, USA) to yield plasmid pQE15-W305F (Figure 
5 20) (SEQ ID NO.: 2). Transformation of pQE15-W305F (SEQ ID NO.: 2) into a recA 
positive cell strain resulted in genetic recombination of the mutant MetRS gene with the 
chromosomal copy of the wild-type MetRS gene, yielding plasmid pQE15-MRS (SEQ ID 
NO.: 1). 

10 Expression plasmid pQE15-MRS (SEQ ID NO.: 1) and repressor plasmid pREP4 were 
transformed into the expression host B834(DE3) to yield B834(DE3)/pQE15- 
MRS/pREP4. Plasmid DNA from all B834(DE3)/pQE15-MRS/pREP4 cultures used for 
protein expression experiments was sequenced to confirm that it encoded wild-type 
MetRS. The expression plasmid pQE15-MRS (SEQ ID NO.: 1) encodes MetRS under 

15 control of the E. coli promoter metG pi (Genbank accession number X55791) (F. Dardel, 
M. Panvert, G. Fayat, Mol Gen. Genet 1990, 223, 121-133) as well as the target protein 
murine dihydrofolate reductase (mDHFR) under control of a bacteriophage T5 promoter. 
The expression plasmid also encodes an N-terminal hexahistidine sequence for mDHFR 
which permits purification of the target protein by immobilized metal chelate affinity 

20 chromatography (The Qiagen Expressionist, 1992, p. 45). Furthermore, mDHFR 
contains 8 methionine residues which can be replaced by methionine analogues. A 
control bacterial host, which produces only mDHFR and normal cellular levels of MetRS, 
was prepared by transforming B834(DE3) with pREP4 and pQEl 5. 

25 Similarly, a bacterial host capable of overexpressing MetRS was produced by 
transforming E. coli strains CAG18491 (Novagen, Inc., Madison, WI, USA), a 
methionine auxotroph, with repressor plasmid pREP4 and expression plasmid pQE15- 
MRS, as described for the B834(DE3) strain. A control bacterial host, which produces 
only mDHFR and normal cellular levels of MetRS, was prepared by transforming 

30 CAG18491with pREP4 and pQE15. 
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Methionine analogues 2-13 were tested for translational activity in both bacterial hosts. 
Methionine analogues were synthesized via alkylation of diethylacetamidomalonate, as 
previously described (As described in Example I, infra\ J. C. M. van Hest, D. A. Tirrell, 
FEBS Lett. 1998, 428, 68-70). Cultures of B834(DE3)/pQE15-MRS/pREP4 or 
5 B834(DE3)/pQE15/pREP4 or CAG18491/pQE15-MRS/pREP4 or 

CAG18491/pQE15/pREP4 in M9AA media were grown to an optical density of 0.90, and 
the cells were sedimented by centrifugation. The M9AA medium was prepared by 
supplementing sterile M9 medium with 60 mg/ml of each of the amino acids, 1 mM 
MgS04> 0.2 wt% glucose, 1 mg/ml thiamine chloride, and 1 mg/ml calcium chloride. 

10 The antibiotics ampicillin and kanamycin were added at concentrations of 200 mg/1 and 
35 mg/1, respectively. 

Cells were washed three times with M9 salts and resuspended to an optical density of 
0.90 in M9 test media containing 19 amino acids plus 1) neither methionine nor analogue 

15 (negative control); 2) methionine (60 mg/liter, positive control); or 3) an analogue of 
interest (60 mg/liter). To test the effect of increasing the level of supplementation of the 
analogues, a set of experiments was also conducted in which the medium was 
supplemented with 500 mg/liter of methionine or the amino acid analogue. Expression of 
mDHFR was induced by addition of 0.4 mM isopropyl p-D-thiogalactopyranoside 

20 (IPTG), and protein synthesis was monitored after 4.5 hours. Expression of mDHFR was 
monitored by SDS-polyacrylamide gel electrophoresis (SDS-PAGE); accumulation of 
target protein was taken as evidence for translational activity of the methionine analogue. 

For cultures supplemented with amino acids at 60 mg/liter, the target protein was not 
25 observed in the negative control culture of B834(DE3)/pQE15/pREP4, 
CAG18491/pQE15/pREP4 or in cultures supplemented with Ccg (4), 6,6,6-trifluoro-2- 
aminohexanoic acid (6), 2-aminoheptanoic acid (7), norvaline (8) o-allylserine (10), 
allylgylcine (12) or propargylglycine (13). In contrast, mDHFR was detected in both 
bacterial host cultures supplemented with methionine (1), Hag (2), Hpg (3), and 
30 norleucine (9), as indicated by the appearance of a protein band at the position expected 
for mDHFR in SDS-PAGE. 
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For the negative control cultures and for cultures supplemented with Teg however, the 
behavior of the bacterial hosts differed, as shown in Figure 7. mDHFR was not detected 
in the B834(DE3)/pQE15/pREP4 culture supplemented with Teg, while strong induction 
5 of mDHFR was observed for B834(DE3)/pQE15-MRS/pREP4 under the same 
conditions. Even the unsupplemented control culture of B834(DE3)/pQE15- 
MRS/pREP4 shows evidence of mDHFR synthesis, suggesting that introduction of 
pQE15-MRS (SEQ ID NO.: 1) does indeed increase the rate of activation of methionine 
in the modified host. 

10 

B834(DE3)/pQE15-MRS/pREP4 cells, which overexpress MetRS, have sufficient MetRS 
activity to synthesize measurable levels of protein from the very low intracellular levels 
of methionine in the negative control culture. Interestingly, aminoacyl-tRNA synthetase 
overexpression is induced by amino acid starvation in some gram-positive bacteria, 
15 presumably to permit continued protein synthesis (D. Luo, J. Leautey, M. Grunberg- 
Manago, H. Putzer, 1 Bacteriol 1997, 179, 2472-2478). B834(DE3)/pQE15/pREP4 
cultures, which lack the increased MetRS activity, do not show background expression of 
protein in negative control cultures. 

20 Similar results were observed for the CAG18491/pQE15/pREP4 and CAG18491/pQE15- 
MRS/pREP4 cultures supplemented with 11 (Figure 8). While mDHFR was not detected 
in the CAG18491/pQE15/pREP4 cultures supplemented with 11, strong induction of 
mDHFR was observed for CAG18491/pQE15-MRS/pREP4 under the same conditions. 
The unsupplemented control culture of CAG18491/pQE15-MRS/pREP4 showed little 

25 evidence of mDHFR synthesis, which may be due to lower levels of MetRS activity in 
these cell strains versus that in the B834(DE3)/pQE15-MRS/pREP4. 

For cultures supplemented with amino acids at 500 mg/liter, however, the target protein 
mDHFR could be observed for certain amino acid analogues only in cultures of cellular 
30 hosts containing the MetRS (Figure 9). Figure 9 demonstrates that the modified bacterial 
hosts CAG18491/pQE15-MRS/pREP4 are able to produce the target protein in cultures 
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supplemented with 500 mg/liter of 4, 7, 8, and 12. Quantitative characterization of the 
kinetic parameters of these analogues demonstrates that although the analogues do not 
support measurable levels of PPj exchange after 20 minutes, they are activated by the 
MetRS in vitro (Figure 10). Due to the very slow rate of activation supported by these 
5 analogues, increasing the concentration of the analogues in the medium must be 
combined with introduction of pQE15-MRS (SEQ ID NO.: 1) into the bacterial host in 
order to raise the rate of activation of these analogues sufficiently to permit protein 
biosynthesis. 

To confirm the supposition that the MetRS activity of a cellular host is increased by the 
introduction of pQE15-MRS, direct measurement of the MetRS activities of whole cell 
lysates was conducted. B834(DE3)/pQE15-MRS/pREP4 exhibits a V max for methionine 
activation approximately 30-fold higher than that observed for the control host 
B834(DE3)/pQE15/pREP4 (Figure .11). Similarly, CAG18491/pQE15-MRS/pREP4 
exhibits a V max for methionine activation approximately 50-fold higher than that observed 
for the control host CAG18491/pQE15/pREP4. ATP-PPi exchange assays were 
conducted using the methods as described supra. A 50-|il aliquot of whole cell lysate 
with a normalized OD600 of 20 was prepared by one freeze-thaw cycle and added to the 
assay mixture to yield a final volume of 150 pi. A saturating concentration of methionine 
(750 pM) was used to determine the maximum exchange velocity for each cell lysate. 

These results show clearly that increasing the MetRS activity of the host is necessary and 
sufficient to observe translational activity of 4, 5, 7, 8, 11, and 12 under convenient 
conditions in vivo. Protein yields (mDHFR-Tcg) of approximately 8.5 mg/liter were 
25 observed for B834(DE3)/pQE15-MRS/pREP4 cultures supplemented with Teg, 
compared with yields of approximately 35 mg/liter for both B834(DE3)/pQE15- 
MRS/pREP4 and B834(DE3)/pQE15/pREP4 cultures supplemented with methionine. 
Amino acid analysis of protein containing Teg shows a decrease in methionine content to 
0.3 mol% from the expected value of 3.8 mol%. It was not possible to detect Teg directly 
30 by amino acid analysis, owing to instability of the analogue under the analysis conditions. 
If depletion of methionine is assumed to result from replacement by Teg, the observed 
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analysis corresponds to an overall extent of incorporation of the analogue of 91±2%. 
Amino acid analysis of mDHFR containing the other analogues (4, 7, 8, 11, and 12), 
showed 92-98% replacement of methionine. 

5 A direct assessment of the extent of incorporation of Teg into mDHFR was provided by 
NMR spectroscopy. Proton NMR spectra were recorded using a Varian Inova NMR 
spectrometer with proton acquisition at 599.69 MHz. Spectra were recorded at 25°C 
overnight. A simple presaturation pulse was used for water suppression. Comparisons of 
the 600 MHz proton NMR spectra (Figure 12) of mDHFR, Teg, and mDHFR-Tcg 

10 indicate the appearance, in the mDHFR-Tcg spectrum (Figure 12c), of the Teg vinylene 
protons at 5= 5.35 (5-CH) and 5.60-5.70 (y-CH). The resonances at S= 5.35 and 5= 
5.70 occur at the same chemical shift values as in free Teg and are clearly due to 
incorporation of Teg into mDHFR. That the resonance at 8 = 5.60 arises from the y-CH 
vinylene proton of Teg is suggested by the fact that the integrated intensity of the 

15 resonance at 8= 5.35 equals the sum of the integrations of the resonances at 8= 5.60 and 
8= 5.70. This assignment is confirmed by ID TOCSY (Total Correlation Spectroscopy) 
experiments which indicate that the protons at both 8- 5.60 and 8= 5.70 are members of 
the same spin system (and therefore the same amino acid) as those at 8- 5.35. More 
importantly, the ID TOCSY experiments also show that the protons at 8= 5.35 (and 

20 therefore those at 8= 5.60 and 8= 5.70) are associated with the spin system of the entire 
Teg side chain (ID TOCSY spectra were recorded on a Varian Inova NMR spectrometer 
with proton acquisition at 599.69 MHz). 

A ID TOCSY pulse sequence (D. Uhrin, P. N. Barlow, X Magn. Reson. 1997, 126, 248- 
25 255) with selective irradiation of the resonance at £=5. 35 (E. Kupce, J. Boyd, I. D. 
Campbell, J. Magn. Reson. Ser. B. 1995, 106, 300-303) was used to identify which 
protons belonged to the spin system of the 8=5. 35 resonance. The selectivity of the 
pulse is demonstrated in a separate, simple ID experiment in which the selective pulse 
was applied alone; no other resonances were observed in the spectrum under these 
30 conditions. Observation after a mixing time of 60 ms, however, showed the protons at 
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£=5.60 and 5=5 JO, indicating that those protons are members of the same spin 
system (and therefore the same amino acid residue) as those corresponding to the 
resonance at 5- 5.35. The ot-carbon and side chain (i- and e-carbon protons were also 
observed at chemical shift values characteristic of the free amino acid (5=4.3 (a-CH), 
5 2.5 (P-CH2), and 1.6 (5-CH3)). Integration of the spectrum suggests that 5 of the 8 
methionine positions (occupied by Teg) are represented by the resonance at 5= 5.60; 
these protons must reside in a magnetically-distinct environment from the protons at 8 = 
5.70. These results unequivocally demonstrate the translational activity of Teg in the 
host strain outfitted with elevated MetRS activity. Integration of the NMR spectrum 
1 0 indicates 90±6% replacement of methionine. 

Retention of the N-terminal (initiator) methionine in mDHFR was expected on the basis 
of the identity of the penultimate amino acid (P. H. Hirel, J. M. Schmitter, P. Dessen, G. 
Fayat, S. Blanquet, Proc. Natl Acad. Sci. USA 1989, 86, 8247-8251), so N-terminal 

15 sequencing provided a third means of assessing the extent of replacement of methionine 
by analogues 4, 5, 7, 8, 11, and 12. Because the analogues were not degraded under the 
analysis conditions, they could be detected directly. Comparison of chromatograms of 
the N-terminal residues of mDHFR and mDHFR-Tcg (Figure 13) demonstrate that the 
methionine that normally occupies the initiator position of mDHFR (Figure 13 a) was 

20 nearly completely replaced with Teg (Figure 13b) in mDHFR-Tcg (Figure 13c). The 
signal corresponding to methionine eluted at 13.8 min while that corresponding to Teg 
eluted at 16.0 min. The large peaks (pptu) which elute at approximately 15.4 min 
correspond to piperidylphenylthiourea, a product of the analysis resulting from the buffer, 
and the small peak (diet) at 19.4 min corresponds to diethylphthalate, an internal 

25 standard. These results clearly indicate the incorporation of Teg at the initiator site of 
mDHFR-Tcg and corroborate the NMR results. Integration of the peak areas 
corresponding to Teg and to methionine indicates 96±2% incorporation of the analogue at 
the initiator position. Similar analysis of mDHFR-2bg (2-butynylglycine) by N-terminal 
sequencing indicates 98 ± 2% replacement of methionine (Figure 14). Analysis of 

30 mDHFR containining analogues 8 and 12 shows replacement of methionine by these 
analogues at levels of 85-90%. 
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The incorporation of 4, 5, 7, 8, 11, and 12 into proteins in vivo constitutes the first 
example of broadening the amino acid substrate range of the E. coli translational 
apparatus via overproduction of MetRS in a bacterial host. The utilization of 4, 5, 7, 8, 
5 11, and 12 in all stages of protein synthesis (including initiation) indicates the 
appropriateness of targeting the aminoacyl-tRNA synthetase in studies aimed at in vivo 
incorporation of amino acid analogues into proteins. Transport into the cell, recognition 
by methionyl-tRNA formylase, and recognition by the elongation factors and the 
ribosome are less likely to be limiting factors. 

10 

These results indicate that this simple strategy of overexpression of aminoacyl-tRNA 
synthetase may be used to modify proteins by incorporation of amino acid analogues that 
are poor substrates for aminoacyl-tRNA synthetase and that would be essentially inactive 
in conventional expression hosts. 

15 

Overexpression of mutant forms of the aminoacyl-tRNA synthetase prepared via site- 
directed mutagenesis or directed evolution (F. H. Arnold, J. C. Moore, Adv. Biochem. 
Eng. Biotech 1997, 58, 1-14; F. H. Arnold, Chem. Eng. Sci. 1996, 51, 5091-5102) should 
provide additional strategies for incorporating amino acid analogues into proteins in vivo. 

20 

The results reported here also suggest new opportunities for macrbmolecular synthesis 
via protein engineering. The versatile chemistry of unsaturated functional groups (B. M. 
Trost, L Fleming,, Comprehensive Organic Synthesis, Pergamom Press, Oxford, 1991) 
can be used to control protein structure and function through chemical derivatization, an 

25 especially intriguing possibility in this case given the important role of methionine in 
protein-protein recognition processes. For example, ruthenium-catalyzed olefin 
metathesis (D. M. Lynn, B. Mohr, R. H. Grubbs, J. Am. Chem. Soc. 1998, 120, 1027- 
1028; E. L. Dias, T. N. SonBinh, R. H. Grubbs, J. Am. Chem. Soc. 1997, 119, 3887-3897) 
of homoallylglycine (T. D. Clark, M. R. Ghadiri, J. Am. Chem. Soc. 1995, 117, 12364- 

30 12365) and o-allylserine (H. E. Blackwell, R. H. Grubbs, Angew. Chem. 1998, 110, 3469- 
3472; Angew. Chem. Int. Ed. 1998, 37, 3281-3284) side chains has been used to produce 
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covalently-modified peptide structures of various kinds. The incorporation of Teg may 
be singularly useful in this regard as the internal olefin is active in aqueous-phase ring 
closing metathesis reactions, whereas terminally-unsaturated groups (such as those 
previously used to replace methionine in vivo) are not (T. A. Kirkland, D. M. Lynn, R. H. 
5 Grubbs, J. Org Chem. 1998, 63, 9904-9909). 

EXAMPLE III 

This example demonstrates that activation of methionine analogues in vitro correlates 
10 well with the ability of these analogues to support protein synthesis in vivo, substantiating 
the critical role of aminoacyl-tRNA synthetase in controlling the incorporation of amino 
acid analogues into proteins. 

Reagents 

15 

Each of the analogues 2-7 and 11 (Figure 2) was prepared by alkylation of diethyl 
acetamidomalonate with the appropriate tosylate followed by decarboxylation and 
deprotection of the amine function (as described in Example I). Methionine and 
analogues 8, 9, 12 and 13 were obtained from Sigma (St. Louis, Mo). Radiolabeled 

20 sodium pyrophosphate was purchased from NEN Life Science Products, Inc., and 
isopropyl-p-D-thiogalactopyranoside was obtained from Calbiochem. The RGS-His 
antibody and anti-mouse IgG horseradish peroxidase conjugate used for Western blotting 
procedures were obtained from Qiagen and Amersham Life Sciences, respectively. All 
other reagents used during protein biosynthesis and purification and for activation assays 

25 were commercially available from Sigma, Aldrich, and Qiagen, and were used as 
received. 

In vitro activation assays 

30 The fully active, truncated form of MetRS was purified from overnight cultures of E, coli 
JM101 cells carrying the plasmid pGG3 (Kim, H.Y.; Ghosh, G.; Schulman, L.H.; Brunie, 
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S.; Jakubowski, H. Proc. Natl Acad. Set USA 1993, 90, 11553-11557), by using size 
exclusion methods previously reported (Mellot, P.; Mechulam, Y.; LeCorre, D.; 
Blanquet, S.; Fayat, G. J. Mol Biol 1989, 208, 429-443). Purified enzyme solutions (in 
10 mM phosphate, pH 6.7, 10 mM p-mercaptoethanol) were concentrated to at least 3 
5 jaM prior to their storage in 40% glycerol at -20°C. Concentrations of enzyme stocks 
were determined by the Bradford method, using samples of MetRS quantified by amino 
acid analysis as standards. 

Activation of methionine analogues by MetRS was assayed via the amino acid-dependent 
10 ATP-PPj exchange reaction at room temperature, also as previously described (Mellot, P.; 

Mechulam, Y.; LeCorre, D.; Blanquet, S.; Fayat, G. J. Mol Biol 1989, 208, 429-443; 

Ghosh, G.; Pelka, H.; Schulman, L.D. Biochemistry 1990, 29, 2220-2225). The assay, 

which measures the 32 P-radiolabeled ATP formed by the enzyme-catalyzed exchange of 

32 P-pyrophosphate (PPj) into ATP, was conducted in 150 (al of reaction buffer (pH 7.6, 20 
15 mM imidazole, 0.1 mM EDTA, 10 mM P-mercaptoethanol, 7 mM MgCl 2 , 2 mM ATP, 

0.1 mg/ml BSA, and 2 mM PPj (in the form of sodium pyrophosphate with a specific 

activity of approximately 0.5 TBq/mole)). 

Kinetic parameters for methionine analogues 2, 3, 5, 9 and 11 were obtained with an 
20 enzyme concentration of 75 nM and analogue concentrations of 100 jjM to 20 mM. 
Parameters for methionine were obtained by using methionine concentrations ranging 
from 10 \xM to 1 mM. Aliquots (20 jil) were removed from the reaction mixture at 
various time points and quenched in 0.5 ml of a solution comprising 200 mM PP„ 7% 
w/v HCIO4, and 3% w/v activated charcoal. The charcoal was rinsed twice with 0.5 mL 
25 of a 10 mM PPj, 0.5% HCIO4 solution and then resuspended in 0.5 mL of this solution 
and counted via liquid scintillation methods. Kinetic constants were calculated by a 
nonlinear regression fit of the data to the Michaelis-Menten model. 
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In vivo incorporation of amino acid analogues 

Buffers and media were prepared according to standard protocols (Ausubel, F.M.; Brent, 
R.; Kingston, R.E.; Moore, D.D.; Seidman, J.G.; Smith, J.A.; Struhl, K., Eds. Current 
5 Protocols in Molecular Biology; John Wiley and Sons: New York, 1998). The E. coli 
methionine auxotroph CAG18491 (X, rph-1, metE3079.:Tn!0) was transformed with 
plasmids pQE15 and pREP4 (Qiagen), to obtain the expression host 
CAG18491/pQE15/pREP4. The auxotroph was transformed with the plasmids pQE15- 
MRS (SEQ ID NO.: 1) and pREP4 to obtain the modified bacterial expression host 
10 CAG18491/pQE15-MRS/pREP4. Both bacterial expression hosts produce the target 
protein mDHFR under control of a bacteriophage T5 promoter; the modified host also 
expresses extra copies of the MetRS gene under control of the constitutive metG pi 
promoter (Dardel, R; Panvert, M.; Fayat, G. Mol Gen, Genet 1990, 225, 121-133). 

1 5 Protein expression (1 liter scale). 

Similar procedures were used for preparation and isolation of mDHFR from media 
supplemented with the L-isomers of 1, 2, 3, or 9. M9AA medium (100 mL) supplemented 
with 1 mM MgS04 0.2w% glucose, 1 mg/L thiamine chloride and the antibiotics 

20 ampicillin (200 mg/L) and kanamycin (35 mg/L) was inoculated with the appropriate E. 
coli strain (CAG18491/pQE15/pREP4 or CAG18491/pQE15-MRS/pREP4) and grown 
overnight at 37°C. This culture was used to inoculate 900, mL M9AA medium 
supplemented as described. The cells were grown to an optical density at 600 nm 
(ODeoo) of approximately 0.9 and a medium shift was performed. The cells were 

25 sedimented for 10 min at 3030 g at 4°C, the supernatant was removed, and the cell pellet 
was washed twice with 600 mL of M9 medium. Cells were resuspended in 1 000 mL of 
the M9AA medium described above, without methionine, and supplemented with 20 
mg/L of the L-isomer of either 1, 2, 3, or 9. Protein synthesis was induced by addition of 
isopropyl-P-D-thiogalactopyranoside (IPTG) to a final concentration of 0.4 mM. 

30 Samples (1 mL) were collected after 4.5 hours, the OD600 measured, and cells 
resuspended with distilled water to yield a normalized OD600 of 20. Protein expression 
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was monitored by SDS polyacrylamide gel electrophoresis (12% acrylamide running 
gel); accumulation of mDHFR could be observed at an apparent molar mass of 
approximately 28 kDa after Coomassie staining. 

5 Protein purification. 

Approximately 4.5 h after induction, cells were sedimented (9,800 g, 10 min, 4°C) and 
the supernatant was removed. The pellet was placed in the freezer overnight. The cells 
were thawed for 30 min at 37°C, 30 mL of buffer (6 M guanidine-HCl, 0.1 M NaH 2 P0 4 , 

10 0.01 M Tris, pH 8) was added and the mixture was shaken at room temperature for 1 h. 
The cell debris was sedimented (15,300 g, 20 min, 4°C) and the supernatant was 
subjected to immobilized metal affinity chromatography (Ni-NTA resin) according to the 
procedure described by Qiagen (The Qiagen Expressionist; Qiagen; Valencia, CA, 2000). 
The supernatant was loaded on 10 mL of resin which was then washed with 50 mL of 

15 guanidine buffer followed by 25 mL of urea buffer (8 M urea, 0.1 M NaH2P0 4 and 0.01 
M Tris, pH 8). Similar urea buffers were used for three successive 25 mL washes at pH 
values of 6.3, 5.9 and 4.5, respectively. Target protein was obtained in washes at pH 5.9 
and 4.5. These washes were combined and dialyzed (Spectra/Por membrane 1, 
MWCO=6-8 kDa) by batchwise dialysis against doubly distilled water for 4 days with at 

20 least 12 total changes of water. The dialysate was lyophilized to a purified powder of 
mDHFR. Experiments in M9AA medium afford approximately 30 mg of mDHFR for 
each of the bacterial expression hosts, while a control experiment in 2xYT medium 
afforded approximately 60 mg of mDHFR. Protein yields are reported as mg protein 
obtained per liter of bacterial culture; approximately 5-6 g of wet cells are obtained per 

25 liter of culture regardless of the identity of the analogue used to supplement the medium. 

Protein expression (5 mL scale). 

M9AA medium (50 mL) supplemented with 1 mM MgSC>4 5 0.2 wt% glucose, 1 mg/L 
30 thiamine chloride and the antibiotics ampicillin (200 mg/L) and kanamycin (35 mg/L) 
was inoculated with 5 mL of an overnight culture of the appropriate bacterial expression 
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host. When the turbidity of the culture reached an OD600 of 0.8, a medium shift was 
performed. The cells were sedimented for 10 min at 3030 g at 4°C, the supernatant was 
removed, and the cell pellet was washed twice with 25 mL of M9 medium. Cells were 
resuspended in 50 mL of the M9AA medium described above, without methionine. Test 
5 tubes containing 5 mL aliquots of the resulting culture were prepared, and were 
supplemented with 10 [iL of 10 mg/mL L-methionine (1) (positive control), L- 
homoallylglycine (2), L-homopropargylglycine (3), or L-norleucine (9), respectively. A 
culture lacking methionine (or any analogue) served as the negative control. Protein 
expression was induced by addition of IPTG to a final concentration of 0.4 mM. After 4 
10 h, the OD600 was measured, and the samples were sedimented. After the supernatant was 
decanted, the cell pellets were resuspended in distilled water to yield a normalized OD of 
20. 

Protein expression was monitored by SDS polyacrylamide gel electrophoresis (12% 
15 acrylamide running gel), followed by Western blotting. After transfer to a nitrocellulose 
membrane, Western blots were developed by treatment with a primary RGS-His 
antibody, followed by treatment with a secondary anti-mouse IgG conjugated to 
horseradish peroxidase to provide detection by chemiluminescence. Films were checked 
to ensure that band intensity was not saturated. Levels of protein synthesis were estimated 
20 by the intensity of the band on the gel, as determined using a Pharmacia Ultrascan XL 
laser densitometer and analysis by Pharmacia GelScan XL software. The accumulation 
of target protein is taken as evidence for incorporation of the amino acid analogue, as 2, 
3, 5, 9 and 11 have been shown to replace methionine, even in modified bacterial hosts, at 
levels of 92-98% (van Hest, J.C.M.; Tirrell, D.A. FEBS Lett 1998, 428, 68-70; as 
25 described in Example I, infra; Budisa, N.; Steipe, B.; Demange, P.; Eckerskorn, C; 
Kellermann, J.; Huber, R. Eur. J. Biochem. 1995, 230, 788-796). 

Results and Discussion 

30 Studies with methionine analogues 2-13 (as described in Example I, infra), 
demonstrated that 2 and 3 can be incorporated into proteins with extents of substitution 
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up to 98%. The incorporation of 9 had been previously reported (Budisa, N.; Steipe, B.; 
Demange, P.; Eckerskorn, C; Kellermann, J.; Huber, R. Eur. J. Biochem. 1995, 230, 788- 
796). In contrast, 4 - 8 and 10-13 do not support protein synthesis in the absence of 
methionine in a conventional bacterial expression host; investigation of the activation of 
5 the analogues by methionyl-tRNA synthetase (MetRS) indicated that 4-8 and 10-13 are 
not efficiently activated by the enzyme. 

Overproduction of MetRS in the bacterial host, however, permits incorporation of 4, 5, 7, 
8, 11 and 12, which show very slow exchange of PPj in in vitro activation assays 
10 (Example II, infra). These results indicate that the aminoacyl-tRNA synthetase are 
appropriate targets for studies aimed at the incorporation of amino acid analogues into 
proteins in vivo. The results also suggest that neither transport into the cell nor 
recognition by the elongation factors or the ribosome limits the incorporation of these 
amino acid analogues into proteins in vivo, 

15 

In this example, the in vitro activation of 2 - 13 by MetRS was characterized in order to 
determine the roles of the synthetase in controlling analogue incorporation and protein 
yield in media supplemented with amino acid analogues. Furthermore, the analogues 2 
and 3, which replace methionine in vivo, may be useful for chemical modification of 

20 proteins by olefin metathesis (Clark, T.D.; Kobayashi, K.; Ghadiri, M.R. Chem. Eur. J. 
1999, 5, 782-792; Blackwell, H.E.; Grubbs, R.H. Angew. Chem. Int. Ed Engl 1998, 37, 
3281-3284), palladium-catalyzed coupling (Amatore, C; Jutand, A. J. Organomet. 
Chem. 1999, 576, 255-277; Tsuji, J. Palladium Reagents and Catalysts: Innovations in 
Organic Synthesis; John Wiley and Sons: New York, 1995; Schoenberg, A.; Heck, R.F. 

25 J. Org. Chem. 1974, 39, 3327-3331), and other chemistries (Trost, B.M.; Fleming, L, 
Eds. Comprehensive Organic Synthesis; Pergamon Press: Oxford, 1991). 

The attachment of an amino acid to its cognate tRNA proceeds in two steps (Figure 18). 
Activation, the first step, involves the enzyme-catalyzed formation of an aminoacyl 
30 adenylate (designated aa~AMP in Figure 1 8) and can be examined by monitoring the rate 
of exchange of radiolabeled pyrophosphate ( 32 P-PPj ) into ATP (Fersht, A. Structure and 
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Mechanism in Protein Science; W. H. Freeman and Company: New York, 1999). 
Aminoacylation, the second step, can be evaluated by monitoring the amount of 
radiolabeled amino acid attached to tRNA in the presence of the enzyme. Because initial 
recognition of an amino acid by its aminoacyl-tRNA synthetase is perhaps the most 
5 critical step in the incorporation of amino acid analogues into proteins in vivo. Thus, the 
focus has been on the in vitro activation of methionine analogues by MetRS was 
evaluated and the results compared to those obtained in studies of in vivo incorporation. 

The rates of activation of 2 - 13 by MetRS were determined by the ATP-PPj exchange 
10 assay, and were found to correlate well with the results of the in vivo studies; analogues 
2- 5, 7-9, 11 and 12 (those which had been shown to support protein synthesis) exhibited 
measurable exchange of PPi (Example I and II, infra). The kinetic parameters kcat and K m 
were determined for each of these analogues; the results for analogues 2, 3, 5, 9 and 11 
are summarized in Figure 15. The measured K m for methionine matched previously 
15 reported values (Kim, H.Y.; Ghosh, G.; Schulman, L.H.; Brunie, S.; Jakubowski, H. 
Proc. Natl Acad. Sci. USA 1993, 90, \ 1553-1 1557). The value determined for kc at was 
slightly lower than the literature value. Comparison of the kc a t/K m values for each of the 
analogues with that for methionine showed that these analogues were 500-fold to 13825- 
fold poorer substrates for MetRS than methionine. 

20 

Figure 15 also demonstrates that methionine analogues that are activated up to 2000- fold 
more slowly by MetRS than methionine can support protein synthesis in a conventional 
bacterial host in the absence of methionine. (Poorer substrates, such as 5, require 
modification of the MetRS activity of the bacterial host in order to support protein 

25 synthesis (Example II, infra). These results are comparable to those reported previously 
for the activation and in vivo incorporation of phenylalanine analogues (Gabius, H.J.; von 
der Haar, F.; Cramer, F. Biochemistry 1983, 22, 2331-2339; Kothakota, S.; Mason, T.L.; 
Tirrell, D.A.; Fournier, M.J. J. Am. Chem. Soc. 1995, 777, 536-537; Ibba, M.; Kast, P.; 
Hennecke, H. Biochemistry 1994, 55, 7107-7112). Comparisons for other amino acids 

30 have been limited by a lack of in vitro activation data. The data suggested that amino 
acid analogues can support protein synthesis in vivo even with surprisingly inefficient 
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activation of the amino acid by its aminoacyl-tRNA synthetase. Activation of methionine 
analogues by MetRS governs their ability to support protein synthesis in vivo. 

Based on these results, it seemed likely that the kinetics of analogue activation would 
5 limit the rate and yield of protein synthesis in bacterial cultures supplemented with 
methionine analogues that are poor substrates for MetRS. This correlation was 
investigated by comparing the kinetic constants for analogue activation by MetRS with 
the yield of the target protein murine dihydrofolate reductase (mDHFR) obtained from 1- 
liter cultures of the bacterial host CAG18491/pQE15/pREP4. 

10 

The CAG18491/pQE15/pREP4 bacterial host was produced by transforming the E. coli 
methionine auxotroph CAG 18491 with the expression plasmid pQE15 and the repressor 
plasmid pREP4. The expression plasmid pQE15 encodes mDHFR under control of a 
bacteriophage T5 promoter and an N-terminal hexahistidine sequence that permits 
1 5 purification of the target protein by immobilized metal chelate affinity chromatography 
{The Qiagen Expressionist] Qiagen; Valencia, CA, 2000). 

The kinetic constants for analogue activation and the corresponding protein yields are 
listed in Figure 15 and shown in Figure 16. Analogues with the highest kc at /K m values 
20 also support the highest levels of protein synthesis; the protein yields scale remarkably 
well with k cat /K m? at least for the poorer substrates. Analogue 3 supported protein 
synthesis with yields equivalent to those obtained with methionine, despite the fact that 3 
is a 500-fold poorer substrate for MetRS than methionine. 

25 Bacterial cultures supplemented with 9 (1050-fold lower kcat/K m ) produce 57% as much 
mDHFR as cultures supplemented with methionine, and cultures supplemented with 2 
(1850-fold lower k cat /K m ) produce 28% of the control yield of protein. Bacterial cultures 
supplemented with 5 (4700-fold lower k ca t/K m ) and 11 (13825-fold lower k cat /K m ) did not 
support measurable levels of protein synthesis in this expression host; however, bacterial 

30 hosts exhibiting approximately 30-fold higher MetRS activity produce 23% as much 
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mDHFR in cultures supplemented with 5 as cultures supplemented with methionine 
(Example II 3 infra). 

These results demonstrate that the rate of methionine analogue activation in vitro does 
5 indeed correlate with protein yield in vivo. The results suggest that the kinetics of 
activation can play a critical role in controlling the rate of protein synthesis in 
methionine-depleted cultures supplemented with analogues that are poor substrates for 
MetRS. 

10 Protein yields obtained from bacterial cultures supplemented with methionine analogues 
might be improved by increasing the MetRS activity of the bacterial host. To test this, 
the yields of protein prepared were compared in the conventional bacterial expression 
host, CAG18491/pQE15/pREP4, to those obtained from a modified host, 
C AG 1 849 1 /pQE 1 5-MRS/pREP4 . 

The modified CAG1 8491/pQEl 5-MRS/pREP4 host was prepared by transforming E. coli 
strain CAG18491 with the expression plasmid pQE15-MRS (SEQ ID NO.: 1) (Example 
II, infra) and the repressor plasmid pREP4. The expression plasmid pQE15-MRS (SEQ 
ID NO.: 1) encodes MetRS under control of the E. coli promoter metG pi (Genbank 
20 accession number X55791) (Dardel, F.; Panvert, M.; Fayat, G. Mol Gen. Genet. 1990, 
225, 121-133) as well as the target protein mDHFR. The MetRS activity of the bacterial 
hosts was determined as previously described (Example II, infra), with the modified host 
exhibiting 50-fold higher MetRS activity than the conventional strain. 

25 Protein synthesis was monitored for 5-ml cultures of these hosts supplemented with 
methionine or analogues 2, 3, or 9. Western blot analyses of protein synthesis are shown 
in Figure 17. Although very low levels of protein synthesis were observed for negative 
control cultures of CAG18491/pQE15-MRS/pREP4, amino acid analyses, N-terminal 
sequencing, and NMR analyses of proteins produced in cultures of the modified host 

30 supplemented with 5 and 11 (the poorest of the substrates) still showed 90-96% 
replacement of methionine by 5 and 11 (Example II, infra). Thus, the level of protein 
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synthesis shown in Figure 17 resulted from the incorporation of the analogue and was not 
due to incorporation of residual methionine. 

For cultures supplemented with methionine or 3, the modified host, CAG18491/pQE15- 
5 MRS/pREP4, does not exhibit higher levels of protein synthesis than the conventional 
host CAG18491/pQE15/pREP4. Analysis by laser densitometry confirmed these results, 
and revealed approximately equal accumulation of target protein for both strains; 
identical results have been obtained for large-scale expressions and purification of 
mDHFR. Activation of the analogue by MetRS does not appear to limit protein synthesis 

10 in cultures supplemented with 3. For cultures supplemented with 2 or 9, however, the 
modified bacterial host exhibits significantly increased levels of protein synthesis in 
comparison with the conventional host. Laser densitometry analysis indicates that the 
level of protein synthesis in the modified host is increased approximately 1.5-fold over 
that in the conventional host for cultures supplemented with 2, and approximately 1.4- 

15 fold for cultures supplemented with 9. Activation of these analogues by MetRS appears 
to limit protein synthesis in the conventional host, such that increasing the MetRS activity 
of the host is sufficient to restore high levels of protein synthesis. Preliminary results 
indicated that the yield of mDHFR obtained from large-scale cultures of 
CAG18491/pQE15-MRS/pREP4 supplemented with 2 or 9 are increased to 

20 approximately 35 mg/L (from 10 mg/L obtained from cultures of 
CAG18491/pQE15/pREP4 (Figure 15)). 

The results indicate that overexpression of MetRS can improve protein yields for cultures 
supplemented with methionine analogues that are poor substrates for MetRS, and provide 
25 an attractive general method for efficient production of chemically novel protein 
materials in vivo. 

Quantitative assessment of the kinetics of activation by MetRS have indicated that even 
very poor substrates for the synthetase can be utilized by the protein synthesis machinery 
30 of a bacterial expression host. The correlation, shown herein, between the in vitro and in 
vivo results indicates the important role of the aminoacyl-tRNA synthetase and suggests 
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that site-directed mutatgenesis and/or directed evolution of this class of enzymes may be 
used to increase further the number of amino acid analogues that can be incorporated into 
proteins in vivo. 

These results also indicate that the kinetics of activation of methionine analogues by 
MetRS in vitro correlate with the level of protein synthesis supported by the analogues in 
vivo. The activity of the MetRS in the bacterial host can be manipulated, by 
overexpression of the MetRS, to improve the yields of proteins containing methionine 
analogues that are poor substrates for the MetRS. Overexpression of aminoacyl-tRNA 
synthetase used to improve yields of proteins containing other amino acid analogues, as 
well as proteins rich in particular natural amino acids. Manipulation of the aminoacyl- 
tRNA synthetase activities of a bacterial host broadens the scope of protein engineering, 
by permitting production of natural and artificial proteins, with novel chemical and 
physical properties. 
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